Cargando…

The Embedding Problem for Markov Models of Nucleotide Substitution

Continuous-time Markov processes are often used to model the complex natural phenomenon of sequence evolution. To make the process of sequence evolution tractable, simplifying assumptions are often made about the sequence properties and the underlying process. The validity of one such assumption, ti...

Descripción completa

Detalles Bibliográficos
Autores principales: Verbyla, Klara L., Yap, Von Bing, Pahwa, Anuj, Shao, Yunli, Huttley, Gavin A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3728303/
https://www.ncbi.nlm.nih.gov/pubmed/23935949
http://dx.doi.org/10.1371/journal.pone.0069187
_version_ 1782278837013315584
author Verbyla, Klara L.
Yap, Von Bing
Pahwa, Anuj
Shao, Yunli
Huttley, Gavin A.
author_facet Verbyla, Klara L.
Yap, Von Bing
Pahwa, Anuj
Shao, Yunli
Huttley, Gavin A.
author_sort Verbyla, Klara L.
collection PubMed
description Continuous-time Markov processes are often used to model the complex natural phenomenon of sequence evolution. To make the process of sequence evolution tractable, simplifying assumptions are often made about the sequence properties and the underlying process. The validity of one such assumption, time-homogeneity, has never been explored. Violations of this assumption can be found by identifying non-embeddability. A process is non-embeddable if it can not be embedded in a continuous time-homogeneous Markov process. In this study, non-embeddability was demonstrated to exist when modelling sequence evolution with Markov models. Evidence of non-embeddability was found primarily at the third codon position, possibly resulting from changes in mutation rate over time. Outgroup edges and those with a deeper time depth were found to have an increased probability of the underlying process being non-embeddable. Overall, low levels of non-embeddability were detected when examining individual edges of triads across a diverse set of alignments. Subsequent phylogenetic reconstruction analyses demonstrated that non-embeddability could impact on the correct prediction of phylogenies, but at extremely low levels. Despite the existence of non-embeddability, there is minimal evidence of violations of the local time homogeneity assumption and consequently the impact is likely to be minor.
format Online
Article
Text
id pubmed-3728303
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-37283032013-08-09 The Embedding Problem for Markov Models of Nucleotide Substitution Verbyla, Klara L. Yap, Von Bing Pahwa, Anuj Shao, Yunli Huttley, Gavin A. PLoS One Research Article Continuous-time Markov processes are often used to model the complex natural phenomenon of sequence evolution. To make the process of sequence evolution tractable, simplifying assumptions are often made about the sequence properties and the underlying process. The validity of one such assumption, time-homogeneity, has never been explored. Violations of this assumption can be found by identifying non-embeddability. A process is non-embeddable if it can not be embedded in a continuous time-homogeneous Markov process. In this study, non-embeddability was demonstrated to exist when modelling sequence evolution with Markov models. Evidence of non-embeddability was found primarily at the third codon position, possibly resulting from changes in mutation rate over time. Outgroup edges and those with a deeper time depth were found to have an increased probability of the underlying process being non-embeddable. Overall, low levels of non-embeddability were detected when examining individual edges of triads across a diverse set of alignments. Subsequent phylogenetic reconstruction analyses demonstrated that non-embeddability could impact on the correct prediction of phylogenies, but at extremely low levels. Despite the existence of non-embeddability, there is minimal evidence of violations of the local time homogeneity assumption and consequently the impact is likely to be minor. Public Library of Science 2013-07-30 /pmc/articles/PMC3728303/ /pubmed/23935949 http://dx.doi.org/10.1371/journal.pone.0069187 Text en © 2013 Verbyla et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Verbyla, Klara L.
Yap, Von Bing
Pahwa, Anuj
Shao, Yunli
Huttley, Gavin A.
The Embedding Problem for Markov Models of Nucleotide Substitution
title The Embedding Problem for Markov Models of Nucleotide Substitution
title_full The Embedding Problem for Markov Models of Nucleotide Substitution
title_fullStr The Embedding Problem for Markov Models of Nucleotide Substitution
title_full_unstemmed The Embedding Problem for Markov Models of Nucleotide Substitution
title_short The Embedding Problem for Markov Models of Nucleotide Substitution
title_sort embedding problem for markov models of nucleotide substitution
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3728303/
https://www.ncbi.nlm.nih.gov/pubmed/23935949
http://dx.doi.org/10.1371/journal.pone.0069187
work_keys_str_mv AT verbylaklaral theembeddingproblemformarkovmodelsofnucleotidesubstitution
AT yapvonbing theembeddingproblemformarkovmodelsofnucleotidesubstitution
AT pahwaanuj theembeddingproblemformarkovmodelsofnucleotidesubstitution
AT shaoyunli theembeddingproblemformarkovmodelsofnucleotidesubstitution
AT huttleygavina theembeddingproblemformarkovmodelsofnucleotidesubstitution
AT verbylaklaral embeddingproblemformarkovmodelsofnucleotidesubstitution
AT yapvonbing embeddingproblemformarkovmodelsofnucleotidesubstitution
AT pahwaanuj embeddingproblemformarkovmodelsofnucleotidesubstitution
AT shaoyunli embeddingproblemformarkovmodelsofnucleotidesubstitution
AT huttleygavina embeddingproblemformarkovmodelsofnucleotidesubstitution