Cargando…
RandAL: a randomized approach to aligning DNA sequences to reference genomes
BACKGROUND: The alignment of short reads generated by next-generation sequencers to genomes is an important problem in many biomedical and bioinformatics applications. Although many proposed methods work very well on narrow ranges of read lengths, they tend to suffer in performance and alignment qua...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4120144/ https://www.ncbi.nlm.nih.gov/pubmed/25081493 http://dx.doi.org/10.1186/1471-2164-15-S5-S2 |
_version_ | 1782329045393866752 |
---|---|
author | Vo, Nam S Tran, Quang Niraula, Nobal Phan, Vinhthuy |
author_facet | Vo, Nam S Tran, Quang Niraula, Nobal Phan, Vinhthuy |
author_sort | Vo, Nam S |
collection | PubMed |
description | BACKGROUND: The alignment of short reads generated by next-generation sequencers to genomes is an important problem in many biomedical and bioinformatics applications. Although many proposed methods work very well on narrow ranges of read lengths, they tend to suffer in performance and alignment quality for reads outside of these ranges. RESULTS: We introduce RandAL, a novel method that aligns DNA sequences to reference genomes. Our approach utilizes two FM indices to facilitate efficient bidirectional searching, a pruning heuristic to speed up the computing of edit distances, and most importantly, a randomized strategy that enables effective estimation of key parameters. Extensive comparisons showed that RandAL outperformed popular aligners in most instances and was unique in its consistent and accurate performance over a wide range of read lengths and error rates. The software package is publicly available at https://github.com/namsyvo/RandAL. CONCLUSIONS: RandAL promises to align effectively and accurately short reads that come from a variety of technologies with different read lengths and rates of sequencing error. |
format | Online Article Text |
id | pubmed-4120144 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-41201442014-08-11 RandAL: a randomized approach to aligning DNA sequences to reference genomes Vo, Nam S Tran, Quang Niraula, Nobal Phan, Vinhthuy BMC Genomics Research BACKGROUND: The alignment of short reads generated by next-generation sequencers to genomes is an important problem in many biomedical and bioinformatics applications. Although many proposed methods work very well on narrow ranges of read lengths, they tend to suffer in performance and alignment quality for reads outside of these ranges. RESULTS: We introduce RandAL, a novel method that aligns DNA sequences to reference genomes. Our approach utilizes two FM indices to facilitate efficient bidirectional searching, a pruning heuristic to speed up the computing of edit distances, and most importantly, a randomized strategy that enables effective estimation of key parameters. Extensive comparisons showed that RandAL outperformed popular aligners in most instances and was unique in its consistent and accurate performance over a wide range of read lengths and error rates. The software package is publicly available at https://github.com/namsyvo/RandAL. CONCLUSIONS: RandAL promises to align effectively and accurately short reads that come from a variety of technologies with different read lengths and rates of sequencing error. BioMed Central 2014-07-14 /pmc/articles/PMC4120144/ /pubmed/25081493 http://dx.doi.org/10.1186/1471-2164-15-S5-S2 Text en Copyright © 2014 Vo et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Vo, Nam S Tran, Quang Niraula, Nobal Phan, Vinhthuy RandAL: a randomized approach to aligning DNA sequences to reference genomes |
title | RandAL: a randomized approach to aligning DNA sequences to reference genomes |
title_full | RandAL: a randomized approach to aligning DNA sequences to reference genomes |
title_fullStr | RandAL: a randomized approach to aligning DNA sequences to reference genomes |
title_full_unstemmed | RandAL: a randomized approach to aligning DNA sequences to reference genomes |
title_short | RandAL: a randomized approach to aligning DNA sequences to reference genomes |
title_sort | randal: a randomized approach to aligning dna sequences to reference genomes |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4120144/ https://www.ncbi.nlm.nih.gov/pubmed/25081493 http://dx.doi.org/10.1186/1471-2164-15-S5-S2 |
work_keys_str_mv | AT vonams randalarandomizedapproachtoaligningdnasequencestoreferencegenomes AT tranquang randalarandomizedapproachtoaligningdnasequencestoreferencegenomes AT niraulanobal randalarandomizedapproachtoaligningdnasequencestoreferencegenomes AT phanvinhthuy randalarandomizedapproachtoaligningdnasequencestoreferencegenomes |