Cargando…

Fast and accurate long-read alignment with Burrows–Wheeler transform

Motivation: Many programs for aligning short sequencing reads to a reference genome have been developed in the last 2 years. Most of them are very efficient for short reads but inefficient or not applicable for reads >200 bp because the algorithms are heavily and specifically tuned for short quer...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Heng, Durbin, Richard
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2828108/
https://www.ncbi.nlm.nih.gov/pubmed/20080505
http://dx.doi.org/10.1093/bioinformatics/btp698
_version_ 1782177992637677568
author Li, Heng
Durbin, Richard
author_facet Li, Heng
Durbin, Richard
author_sort Li, Heng
collection PubMed
description Motivation: Many programs for aligning short sequencing reads to a reference genome have been developed in the last 2 years. Most of them are very efficient for short reads but inefficient or not applicable for reads >200 bp because the algorithms are heavily and specifically tuned for short queries with low sequencing error rate. However, some sequencing platforms already produce longer reads and others are expected to become available soon. For longer reads, hashing-based software such as BLAT and SSAHA2 remain the only choices. Nonetheless, these methods are substantially slower than short-read aligners in terms of aligned bases per unit time. Results: We designed and implemented a new algorithm, Burrows-Wheeler Aligner's Smith-Waterman Alignment (BWA-SW), to align long sequences up to 1 Mb against a large sequence database (e.g. the human genome) with a few gigabytes of memory. The algorithm is as accurate as SSAHA2, more accurate than BLAT, and is several to tens of times faster than both. Availability: http://bio-bwa.sourceforge.net Contact: rd@sanger.ac.uk
format Text
id pubmed-2828108
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-28281082010-02-25 Fast and accurate long-read alignment with Burrows–Wheeler transform Li, Heng Durbin, Richard Bioinformatics Original Papers Motivation: Many programs for aligning short sequencing reads to a reference genome have been developed in the last 2 years. Most of them are very efficient for short reads but inefficient or not applicable for reads >200 bp because the algorithms are heavily and specifically tuned for short queries with low sequencing error rate. However, some sequencing platforms already produce longer reads and others are expected to become available soon. For longer reads, hashing-based software such as BLAT and SSAHA2 remain the only choices. Nonetheless, these methods are substantially slower than short-read aligners in terms of aligned bases per unit time. Results: We designed and implemented a new algorithm, Burrows-Wheeler Aligner's Smith-Waterman Alignment (BWA-SW), to align long sequences up to 1 Mb against a large sequence database (e.g. the human genome) with a few gigabytes of memory. The algorithm is as accurate as SSAHA2, more accurate than BLAT, and is several to tens of times faster than both. Availability: http://bio-bwa.sourceforge.net Contact: rd@sanger.ac.uk Oxford University Press 2010-03-01 2010-01-15 /pmc/articles/PMC2828108/ /pubmed/20080505 http://dx.doi.org/10.1093/bioinformatics/btp698 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Li, Heng
Durbin, Richard
Fast and accurate long-read alignment with Burrows–Wheeler transform
title Fast and accurate long-read alignment with Burrows–Wheeler transform
title_full Fast and accurate long-read alignment with Burrows–Wheeler transform
title_fullStr Fast and accurate long-read alignment with Burrows–Wheeler transform
title_full_unstemmed Fast and accurate long-read alignment with Burrows–Wheeler transform
title_short Fast and accurate long-read alignment with Burrows–Wheeler transform
title_sort fast and accurate long-read alignment with burrows–wheeler transform
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2828108/
https://www.ncbi.nlm.nih.gov/pubmed/20080505
http://dx.doi.org/10.1093/bioinformatics/btp698
work_keys_str_mv AT liheng fastandaccuratelongreadalignmentwithburrowswheelertransform
AT durbinrichard fastandaccuratelongreadalignmentwithburrowswheelertransform