Cargando…

Probabilistic approaches to alignment with tandem repeats

BACKGROUND: Short tandem repeats are ubiquitous in genomic sequences and due to their complex evolutionary history pose a challenge for sequence alignment tools. RESULTS: To better account for the presence of tandem repeats in pairwise sequence alignments, we propose a simple tractable pair hidden M...

Descripción completa

Detalles Bibliográficos
Autores principales: Nánási, Michal, Vinař, Tomáš, Brejová, Broňa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3975930/
https://www.ncbi.nlm.nih.gov/pubmed/24580741
http://dx.doi.org/10.1186/1748-7188-9-3
_version_ 1782310211086712832
author Nánási, Michal
Vinař, Tomáš
Brejová, Broňa
author_facet Nánási, Michal
Vinař, Tomáš
Brejová, Broňa
author_sort Nánási, Michal
collection PubMed
description BACKGROUND: Short tandem repeats are ubiquitous in genomic sequences and due to their complex evolutionary history pose a challenge for sequence alignment tools. RESULTS: To better account for the presence of tandem repeats in pairwise sequence alignments, we propose a simple tractable pair hidden Markov model that explicitly models their presence. Using the framework of gain functions, we design several optimization criteria for decoding this model and describe resulting decoding algorithms, ranging from the traditional Viterbi and posterior decoding to block-based decoding algorithms tailored to our model. We compare the accuracy of individual decoding algorithms on simulated and real data and find that our approach is superior to the classical three-state pair HMM. CONCLUSIONS: Our study illustrates versatility of pair hidden Markov models coupled with appropriate decoding criteria as a modeling tool for capturing complex sequence features.
format Online
Article
Text
id pubmed-3975930
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-39759302014-04-17 Probabilistic approaches to alignment with tandem repeats Nánási, Michal Vinař, Tomáš Brejová, Broňa Algorithms Mol Biol Research BACKGROUND: Short tandem repeats are ubiquitous in genomic sequences and due to their complex evolutionary history pose a challenge for sequence alignment tools. RESULTS: To better account for the presence of tandem repeats in pairwise sequence alignments, we propose a simple tractable pair hidden Markov model that explicitly models their presence. Using the framework of gain functions, we design several optimization criteria for decoding this model and describe resulting decoding algorithms, ranging from the traditional Viterbi and posterior decoding to block-based decoding algorithms tailored to our model. We compare the accuracy of individual decoding algorithms on simulated and real data and find that our approach is superior to the classical three-state pair HMM. CONCLUSIONS: Our study illustrates versatility of pair hidden Markov models coupled with appropriate decoding criteria as a modeling tool for capturing complex sequence features. BioMed Central 2014-03-01 /pmc/articles/PMC3975930/ /pubmed/24580741 http://dx.doi.org/10.1186/1748-7188-9-3 Text en Copyright © 2014 Nánási et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Nánási, Michal
Vinař, Tomáš
Brejová, Broňa
Probabilistic approaches to alignment with tandem repeats
title Probabilistic approaches to alignment with tandem repeats
title_full Probabilistic approaches to alignment with tandem repeats
title_fullStr Probabilistic approaches to alignment with tandem repeats
title_full_unstemmed Probabilistic approaches to alignment with tandem repeats
title_short Probabilistic approaches to alignment with tandem repeats
title_sort probabilistic approaches to alignment with tandem repeats
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3975930/
https://www.ncbi.nlm.nih.gov/pubmed/24580741
http://dx.doi.org/10.1186/1748-7188-9-3
work_keys_str_mv AT nanasimichal probabilisticapproachestoalignmentwithtandemrepeats
AT vinartomas probabilisticapproachestoalignmentwithtandemrepeats
AT brejovabrona probabilisticapproachestoalignmentwithtandemrepeats