Cargando…

Using a priori knowledge to align sequencing reads to their exact genomic position

The use of a priori knowledge in the alignment of targeted sequencing data is investigated using computational experiments. Adapting a Needleman–Wunsch algorithm to incorporate the genomic position information from the targeted capture, we demonstrate that alignment can be done to just the target re...

Descripción completa

Detalles Bibliográficos
Autores principales: Böttcher, René, Amberg, Ronny, Ruzius, F. P., Guryev, V., Verhaegh, Wim F. J., Beyerlein, Peter, van der Zaag, P. J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3439880/
https://www.ncbi.nlm.nih.gov/pubmed/22581774
http://dx.doi.org/10.1093/nar/gks393
Descripción
Sumario:The use of a priori knowledge in the alignment of targeted sequencing data is investigated using computational experiments. Adapting a Needleman–Wunsch algorithm to incorporate the genomic position information from the targeted capture, we demonstrate that alignment can be done to just the target region of interest. When in addition use is made of direct string comparison, an improvement of up to a factor of 8 in alignment speed compared to the fastest conventional aligner (Bowtie) is obtained. This results in a total alignment time in targeted sequencing of around 7 min for aligning approximately 56 million captured reads. For conventional aligners such as Bowtie, BWA or MAQ, alignment to just the target region is not feasible as experiments show that this leads to an additional 88% SNP calls, the vast majority of which are false positives (∼92%).