Cargando…
Fast and accurate long-read alignment with Burrows–Wheeler transform
Motivation: Many programs for aligning short sequencing reads to a reference genome have been developed in the last 2 years. Most of them are very efficient for short reads but inefficient or not applicable for reads >200 bp because the algorithms are heavily and specifically tuned for short quer...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2828108/ https://www.ncbi.nlm.nih.gov/pubmed/20080505 http://dx.doi.org/10.1093/bioinformatics/btp698 |
_version_ | 1782177992637677568 |
---|---|
author | Li, Heng Durbin, Richard |
author_facet | Li, Heng Durbin, Richard |
author_sort | Li, Heng |
collection | PubMed |
description | Motivation: Many programs for aligning short sequencing reads to a reference genome have been developed in the last 2 years. Most of them are very efficient for short reads but inefficient or not applicable for reads >200 bp because the algorithms are heavily and specifically tuned for short queries with low sequencing error rate. However, some sequencing platforms already produce longer reads and others are expected to become available soon. For longer reads, hashing-based software such as BLAT and SSAHA2 remain the only choices. Nonetheless, these methods are substantially slower than short-read aligners in terms of aligned bases per unit time. Results: We designed and implemented a new algorithm, Burrows-Wheeler Aligner's Smith-Waterman Alignment (BWA-SW), to align long sequences up to 1 Mb against a large sequence database (e.g. the human genome) with a few gigabytes of memory. The algorithm is as accurate as SSAHA2, more accurate than BLAT, and is several to tens of times faster than both. Availability: http://bio-bwa.sourceforge.net Contact: rd@sanger.ac.uk |
format | Text |
id | pubmed-2828108 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-28281082010-02-25 Fast and accurate long-read alignment with Burrows–Wheeler transform Li, Heng Durbin, Richard Bioinformatics Original Papers Motivation: Many programs for aligning short sequencing reads to a reference genome have been developed in the last 2 years. Most of them are very efficient for short reads but inefficient or not applicable for reads >200 bp because the algorithms are heavily and specifically tuned for short queries with low sequencing error rate. However, some sequencing platforms already produce longer reads and others are expected to become available soon. For longer reads, hashing-based software such as BLAT and SSAHA2 remain the only choices. Nonetheless, these methods are substantially slower than short-read aligners in terms of aligned bases per unit time. Results: We designed and implemented a new algorithm, Burrows-Wheeler Aligner's Smith-Waterman Alignment (BWA-SW), to align long sequences up to 1 Mb against a large sequence database (e.g. the human genome) with a few gigabytes of memory. The algorithm is as accurate as SSAHA2, more accurate than BLAT, and is several to tens of times faster than both. Availability: http://bio-bwa.sourceforge.net Contact: rd@sanger.ac.uk Oxford University Press 2010-03-01 2010-01-15 /pmc/articles/PMC2828108/ /pubmed/20080505 http://dx.doi.org/10.1093/bioinformatics/btp698 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Li, Heng Durbin, Richard Fast and accurate long-read alignment with Burrows–Wheeler transform |
title | Fast and accurate long-read alignment with Burrows–Wheeler transform |
title_full | Fast and accurate long-read alignment with Burrows–Wheeler transform |
title_fullStr | Fast and accurate long-read alignment with Burrows–Wheeler transform |
title_full_unstemmed | Fast and accurate long-read alignment with Burrows–Wheeler transform |
title_short | Fast and accurate long-read alignment with Burrows–Wheeler transform |
title_sort | fast and accurate long-read alignment with burrows–wheeler transform |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2828108/ https://www.ncbi.nlm.nih.gov/pubmed/20080505 http://dx.doi.org/10.1093/bioinformatics/btp698 |
work_keys_str_mv | AT liheng fastandaccuratelongreadalignmentwithburrowswheelertransform AT durbinrichard fastandaccuratelongreadalignmentwithburrowswheelertransform |