Cargando…
A short note on dynamic programming in a band
BACKGROUND: Third generation sequencing technologies generate long reads that exhibit high error rates, in particular for insertions and deletions which are usually the most difficult errors to cope with. The only exact algorithm capable of aligning sequences with insertions and deletions is a dynam...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6003146/ https://www.ncbi.nlm.nih.gov/pubmed/29902968 http://dx.doi.org/10.1186/s12859-018-2228-9 |
_version_ | 1783332317331193856 |
---|---|
author | Gibrat, Jean-François |
author_facet | Gibrat, Jean-François |
author_sort | Gibrat, Jean-François |
collection | PubMed |
description | BACKGROUND: Third generation sequencing technologies generate long reads that exhibit high error rates, in particular for insertions and deletions which are usually the most difficult errors to cope with. The only exact algorithm capable of aligning sequences with insertions and deletions is a dynamic programming algorithm. RESULTS: In this note, for the sake of efficiency, we consider dynamic programming in a band. We show how to choose the band width in function of the long reads’ error rates, thus obtaining an [Formula: see text] algorithm in space and time. We also propose a procedure to decide whether this algorithm, when applied to semi-global alignments, provides the optimal score. CONCLUSIONS: We suggest that dynamic programming in a band is well suited to the problem of aligning long reads between themselves and can be used as a core component of methods for obtaining a consensus sequence from the long reads alone. The function implementing the dynamic programming algorithm in a band is available, as a standalone program, at: https://forgemia.inra.fr/jean-francois.gibrat/BAND_DYN_PROG.git |
format | Online Article Text |
id | pubmed-6003146 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-60031462018-07-06 A short note on dynamic programming in a band Gibrat, Jean-François BMC Bioinformatics Research Article BACKGROUND: Third generation sequencing technologies generate long reads that exhibit high error rates, in particular for insertions and deletions which are usually the most difficult errors to cope with. The only exact algorithm capable of aligning sequences with insertions and deletions is a dynamic programming algorithm. RESULTS: In this note, for the sake of efficiency, we consider dynamic programming in a band. We show how to choose the band width in function of the long reads’ error rates, thus obtaining an [Formula: see text] algorithm in space and time. We also propose a procedure to decide whether this algorithm, when applied to semi-global alignments, provides the optimal score. CONCLUSIONS: We suggest that dynamic programming in a band is well suited to the problem of aligning long reads between themselves and can be used as a core component of methods for obtaining a consensus sequence from the long reads alone. The function implementing the dynamic programming algorithm in a band is available, as a standalone program, at: https://forgemia.inra.fr/jean-francois.gibrat/BAND_DYN_PROG.git BioMed Central 2018-06-15 /pmc/articles/PMC6003146/ /pubmed/29902968 http://dx.doi.org/10.1186/s12859-018-2228-9 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Gibrat, Jean-François A short note on dynamic programming in a band |
title | A short note on dynamic programming in a band |
title_full | A short note on dynamic programming in a band |
title_fullStr | A short note on dynamic programming in a band |
title_full_unstemmed | A short note on dynamic programming in a band |
title_short | A short note on dynamic programming in a band |
title_sort | short note on dynamic programming in a band |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6003146/ https://www.ncbi.nlm.nih.gov/pubmed/29902968 http://dx.doi.org/10.1186/s12859-018-2228-9 |
work_keys_str_mv | AT gibratjeanfrancois ashortnoteondynamicprogramminginaband AT gibratjeanfrancois shortnoteondynamicprogramminginaband |