Cargando…
Using a priori knowledge to align sequencing reads to their exact genomic position
The use of a priori knowledge in the alignment of targeted sequencing data is investigated using computational experiments. Adapting a Needleman–Wunsch algorithm to incorporate the genomic position information from the targeted capture, we demonstrate that alignment can be done to just the target re...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3439880/ https://www.ncbi.nlm.nih.gov/pubmed/22581774 http://dx.doi.org/10.1093/nar/gks393 |
_version_ | 1782243081854124032 |
---|---|
author | Böttcher, René Amberg, Ronny Ruzius, F. P. Guryev, V. Verhaegh, Wim F. J. Beyerlein, Peter van der Zaag, P. J. |
author_facet | Böttcher, René Amberg, Ronny Ruzius, F. P. Guryev, V. Verhaegh, Wim F. J. Beyerlein, Peter van der Zaag, P. J. |
author_sort | Böttcher, René |
collection | PubMed |
description | The use of a priori knowledge in the alignment of targeted sequencing data is investigated using computational experiments. Adapting a Needleman–Wunsch algorithm to incorporate the genomic position information from the targeted capture, we demonstrate that alignment can be done to just the target region of interest. When in addition use is made of direct string comparison, an improvement of up to a factor of 8 in alignment speed compared to the fastest conventional aligner (Bowtie) is obtained. This results in a total alignment time in targeted sequencing of around 7 min for aligning approximately 56 million captured reads. For conventional aligners such as Bowtie, BWA or MAQ, alignment to just the target region is not feasible as experiments show that this leads to an additional 88% SNP calls, the vast majority of which are false positives (∼92%). |
format | Online Article Text |
id | pubmed-3439880 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-34398802012-09-12 Using a priori knowledge to align sequencing reads to their exact genomic position Böttcher, René Amberg, Ronny Ruzius, F. P. Guryev, V. Verhaegh, Wim F. J. Beyerlein, Peter van der Zaag, P. J. Nucleic Acids Res Methods Online The use of a priori knowledge in the alignment of targeted sequencing data is investigated using computational experiments. Adapting a Needleman–Wunsch algorithm to incorporate the genomic position information from the targeted capture, we demonstrate that alignment can be done to just the target region of interest. When in addition use is made of direct string comparison, an improvement of up to a factor of 8 in alignment speed compared to the fastest conventional aligner (Bowtie) is obtained. This results in a total alignment time in targeted sequencing of around 7 min for aligning approximately 56 million captured reads. For conventional aligners such as Bowtie, BWA or MAQ, alignment to just the target region is not feasible as experiments show that this leads to an additional 88% SNP calls, the vast majority of which are false positives (∼92%). Oxford University Press 2012-09 2012-05-11 /pmc/articles/PMC3439880/ /pubmed/22581774 http://dx.doi.org/10.1093/nar/gks393 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Böttcher, René Amberg, Ronny Ruzius, F. P. Guryev, V. Verhaegh, Wim F. J. Beyerlein, Peter van der Zaag, P. J. Using a priori knowledge to align sequencing reads to their exact genomic position |
title | Using a priori knowledge to align sequencing reads to their exact genomic position |
title_full | Using a priori knowledge to align sequencing reads to their exact genomic position |
title_fullStr | Using a priori knowledge to align sequencing reads to their exact genomic position |
title_full_unstemmed | Using a priori knowledge to align sequencing reads to their exact genomic position |
title_short | Using a priori knowledge to align sequencing reads to their exact genomic position |
title_sort | using a priori knowledge to align sequencing reads to their exact genomic position |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3439880/ https://www.ncbi.nlm.nih.gov/pubmed/22581774 http://dx.doi.org/10.1093/nar/gks393 |
work_keys_str_mv | AT bottcherrene usingaprioriknowledgetoalignsequencingreadstotheirexactgenomicposition AT ambergronny usingaprioriknowledgetoalignsequencingreadstotheirexactgenomicposition AT ruziusfp usingaprioriknowledgetoalignsequencingreadstotheirexactgenomicposition AT guryevv usingaprioriknowledgetoalignsequencingreadstotheirexactgenomicposition AT verhaeghwimfj usingaprioriknowledgetoalignsequencingreadstotheirexactgenomicposition AT beyerleinpeter usingaprioriknowledgetoalignsequencingreadstotheirexactgenomicposition AT vanderzaagpj usingaprioriknowledgetoalignsequencingreadstotheirexactgenomicposition |