Cargando…

Kart: a divide-and-conquer algorithm for NGS read alignment

MOTIVATION: Next-generation sequencing (NGS) provides a great opportunity to investigate genome-wide variation at nucleotide resolution. Due to the huge amount of data, NGS applications require very fast and accurate alignment algorithms. Most existing algorithms for read mapping basically adopt see...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Hsin-Nan, Hsu, Wen-Lian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860120/
https://www.ncbi.nlm.nih.gov/pubmed/28379292
http://dx.doi.org/10.1093/bioinformatics/btx189
_version_ 1783307948974407680
author Lin, Hsin-Nan
Hsu, Wen-Lian
author_facet Lin, Hsin-Nan
Hsu, Wen-Lian
author_sort Lin, Hsin-Nan
collection PubMed
description MOTIVATION: Next-generation sequencing (NGS) provides a great opportunity to investigate genome-wide variation at nucleotide resolution. Due to the huge amount of data, NGS applications require very fast and accurate alignment algorithms. Most existing algorithms for read mapping basically adopt seed-and-extend strategy, which is sequential in nature and takes much longer time on longer reads. RESULTS: We develop a divide-and-conquer algorithm, called Kart, which can process long reads as fast as short reads by dividing a read into small fragments that can be aligned independently. Our experiment result indicates that the average size of fragments requiring the more time-consuming gapped alignment is around 20 bp regardless of the original read length. Furthermore, it can tolerate much higher error rates. The experiments show that Kart spends much less time on longer reads than other aligners and still produce reliable alignments even when the error rate is as high as 15%. AVAILABILITY AND IMPLEMENTATION: Kart is available at https://github.com/hsinnan75/Kart/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-5860120
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58601202018-03-23 Kart: a divide-and-conquer algorithm for NGS read alignment Lin, Hsin-Nan Hsu, Wen-Lian Bioinformatics Original Papers MOTIVATION: Next-generation sequencing (NGS) provides a great opportunity to investigate genome-wide variation at nucleotide resolution. Due to the huge amount of data, NGS applications require very fast and accurate alignment algorithms. Most existing algorithms for read mapping basically adopt seed-and-extend strategy, which is sequential in nature and takes much longer time on longer reads. RESULTS: We develop a divide-and-conquer algorithm, called Kart, which can process long reads as fast as short reads by dividing a read into small fragments that can be aligned independently. Our experiment result indicates that the average size of fragments requiring the more time-consuming gapped alignment is around 20 bp regardless of the original read length. Furthermore, it can tolerate much higher error rates. The experiments show that Kart spends much less time on longer reads than other aligners and still produce reliable alignments even when the error rate is as high as 15%. AVAILABILITY AND IMPLEMENTATION: Kart is available at https://github.com/hsinnan75/Kart/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2017-08-01 2017-04-04 /pmc/articles/PMC5860120/ /pubmed/28379292 http://dx.doi.org/10.1093/bioinformatics/btx189 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Lin, Hsin-Nan
Hsu, Wen-Lian
Kart: a divide-and-conquer algorithm for NGS read alignment
title Kart: a divide-and-conquer algorithm for NGS read alignment
title_full Kart: a divide-and-conquer algorithm for NGS read alignment
title_fullStr Kart: a divide-and-conquer algorithm for NGS read alignment
title_full_unstemmed Kart: a divide-and-conquer algorithm for NGS read alignment
title_short Kart: a divide-and-conquer algorithm for NGS read alignment
title_sort kart: a divide-and-conquer algorithm for ngs read alignment
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860120/
https://www.ncbi.nlm.nih.gov/pubmed/28379292
http://dx.doi.org/10.1093/bioinformatics/btx189
work_keys_str_mv AT linhsinnan kartadivideandconqueralgorithmforngsreadalignment
AT hsuwenlian kartadivideandconqueralgorithmforngsreadalignment