Cargando…

Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory

BACKGROUND: Recent methods have been developed to perform high-throughput sequencing of DNA by Single Molecule Sequencing (SMS). While Next-Generation sequencing methods may produce reads up to several hundred bases long, SMS sequencing produces reads up to tens of kilobases long. Existing alignment...

Descripción completa

Detalles Bibliográficos
Autores principales: Chaisson, Mark J, Tesler, Glenn
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3572422/
https://www.ncbi.nlm.nih.gov/pubmed/22988817
http://dx.doi.org/10.1186/1471-2105-13-238
_version_ 1782259322177191936
author Chaisson, Mark J
Tesler, Glenn
author_facet Chaisson, Mark J
Tesler, Glenn
author_sort Chaisson, Mark J
collection PubMed
description BACKGROUND: Recent methods have been developed to perform high-throughput sequencing of DNA by Single Molecule Sequencing (SMS). While Next-Generation sequencing methods may produce reads up to several hundred bases long, SMS sequencing produces reads up to tens of kilobases long. Existing alignment methods are either too inefficient for high-throughput datasets, or not sensitive enough to align SMS reads, which have a higher error rate than Next-Generation sequencing. RESULTS: We describe the method BLASR (Basic Local Alignment with Successive Refinement) for mapping Single Molecule Sequencing (SMS) reads that are thousands of bases long, with divergence between the read and genome dominated by insertion and deletion error. The method is benchmarked using both simulated reads and reads from a bacterial sequencing project. We also present a combinatorial model of sequencing error that motivates why our approach is effective. CONCLUSIONS: The results indicate that it is possible to map SMS reads with high accuracy and speed. Furthermore, the inferences made on the mapability of SMS reads using our combinatorial model of sequencing error are in agreement with the mapping accuracy demonstrated on simulated reads.
format Online
Article
Text
id pubmed-3572422
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35724222013-02-15 Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory Chaisson, Mark J Tesler, Glenn BMC Bioinformatics Methodology Article BACKGROUND: Recent methods have been developed to perform high-throughput sequencing of DNA by Single Molecule Sequencing (SMS). While Next-Generation sequencing methods may produce reads up to several hundred bases long, SMS sequencing produces reads up to tens of kilobases long. Existing alignment methods are either too inefficient for high-throughput datasets, or not sensitive enough to align SMS reads, which have a higher error rate than Next-Generation sequencing. RESULTS: We describe the method BLASR (Basic Local Alignment with Successive Refinement) for mapping Single Molecule Sequencing (SMS) reads that are thousands of bases long, with divergence between the read and genome dominated by insertion and deletion error. The method is benchmarked using both simulated reads and reads from a bacterial sequencing project. We also present a combinatorial model of sequencing error that motivates why our approach is effective. CONCLUSIONS: The results indicate that it is possible to map SMS reads with high accuracy and speed. Furthermore, the inferences made on the mapability of SMS reads using our combinatorial model of sequencing error are in agreement with the mapping accuracy demonstrated on simulated reads. BioMed Central 2012-09-19 /pmc/articles/PMC3572422/ /pubmed/22988817 http://dx.doi.org/10.1186/1471-2105-13-238 Text en Copyright ©2012 Chaisson and Tesler; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Chaisson, Mark J
Tesler, Glenn
Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory
title Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory
title_full Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory
title_fullStr Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory
title_full_unstemmed Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory
title_short Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory
title_sort mapping single molecule sequencing reads using basic local alignment with successive refinement (blasr): application and theory
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3572422/
https://www.ncbi.nlm.nih.gov/pubmed/22988817
http://dx.doi.org/10.1186/1471-2105-13-238
work_keys_str_mv AT chaissonmarkj mappingsinglemoleculesequencingreadsusingbasiclocalalignmentwithsuccessiverefinementblasrapplicationandtheory
AT teslerglenn mappingsinglemoleculesequencingreadsusingbasiclocalalignmentwithsuccessiverefinementblasrapplicationandtheory