Cargando…

A short note on dynamic programming in a band

BACKGROUND: Third generation sequencing technologies generate long reads that exhibit high error rates, in particular for insertions and deletions which are usually the most difficult errors to cope with. The only exact algorithm capable of aligning sequences with insertions and deletions is a dynam...

Descripción completa

Detalles Bibliográficos
Autor principal: Gibrat, Jean-François
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6003146/
https://www.ncbi.nlm.nih.gov/pubmed/29902968
http://dx.doi.org/10.1186/s12859-018-2228-9
_version_ 1783332317331193856
author Gibrat, Jean-François
author_facet Gibrat, Jean-François
author_sort Gibrat, Jean-François
collection PubMed
description BACKGROUND: Third generation sequencing technologies generate long reads that exhibit high error rates, in particular for insertions and deletions which are usually the most difficult errors to cope with. The only exact algorithm capable of aligning sequences with insertions and deletions is a dynamic programming algorithm. RESULTS: In this note, for the sake of efficiency, we consider dynamic programming in a band. We show how to choose the band width in function of the long reads’ error rates, thus obtaining an [Formula: see text] algorithm in space and time. We also propose a procedure to decide whether this algorithm, when applied to semi-global alignments, provides the optimal score. CONCLUSIONS: We suggest that dynamic programming in a band is well suited to the problem of aligning long reads between themselves and can be used as a core component of methods for obtaining a consensus sequence from the long reads alone. The function implementing the dynamic programming algorithm in a band is available, as a standalone program, at: https://forgemia.inra.fr/jean-francois.gibrat/BAND_DYN_PROG.git
format Online
Article
Text
id pubmed-6003146
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-60031462018-07-06 A short note on dynamic programming in a band Gibrat, Jean-François BMC Bioinformatics Research Article BACKGROUND: Third generation sequencing technologies generate long reads that exhibit high error rates, in particular for insertions and deletions which are usually the most difficult errors to cope with. The only exact algorithm capable of aligning sequences with insertions and deletions is a dynamic programming algorithm. RESULTS: In this note, for the sake of efficiency, we consider dynamic programming in a band. We show how to choose the band width in function of the long reads’ error rates, thus obtaining an [Formula: see text] algorithm in space and time. We also propose a procedure to decide whether this algorithm, when applied to semi-global alignments, provides the optimal score. CONCLUSIONS: We suggest that dynamic programming in a band is well suited to the problem of aligning long reads between themselves and can be used as a core component of methods for obtaining a consensus sequence from the long reads alone. The function implementing the dynamic programming algorithm in a band is available, as a standalone program, at: https://forgemia.inra.fr/jean-francois.gibrat/BAND_DYN_PROG.git BioMed Central 2018-06-15 /pmc/articles/PMC6003146/ /pubmed/29902968 http://dx.doi.org/10.1186/s12859-018-2228-9 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Gibrat, Jean-François
A short note on dynamic programming in a band
title A short note on dynamic programming in a band
title_full A short note on dynamic programming in a band
title_fullStr A short note on dynamic programming in a band
title_full_unstemmed A short note on dynamic programming in a band
title_short A short note on dynamic programming in a band
title_sort short note on dynamic programming in a band
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6003146/
https://www.ncbi.nlm.nih.gov/pubmed/29902968
http://dx.doi.org/10.1186/s12859-018-2228-9
work_keys_str_mv AT gibratjeanfrancois ashortnoteondynamicprogramminginaband
AT gibratjeanfrancois shortnoteondynamicprogramminginaband