Cargando…

A New Hierarchy of Phylogenetic Models Consistent with Heterogeneous Substitution Rates

When the process underlying DNA substitutions varies across evolutionary history, some standard Markov models underlying phylogenetic methods are mathematically inconsistent. The most prominent example is the general time-reversible model (GTR) together with some, but not all, of its submodels. To r...

Descripción completa

Detalles Bibliográficos
Autores principales: Woodhams, Michael D., Fernández-Sánchez, Jesús, Sumner, Jeremy G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4468350/
https://www.ncbi.nlm.nih.gov/pubmed/25858352
http://dx.doi.org/10.1093/sysbio/syv021
_version_ 1782376498493128704
author Woodhams, Michael D.
Fernández-Sánchez, Jesús
Sumner, Jeremy G.
author_facet Woodhams, Michael D.
Fernández-Sánchez, Jesús
Sumner, Jeremy G.
author_sort Woodhams, Michael D.
collection PubMed
description When the process underlying DNA substitutions varies across evolutionary history, some standard Markov models underlying phylogenetic methods are mathematically inconsistent. The most prominent example is the general time-reversible model (GTR) together with some, but not all, of its submodels. To rectify this deficiency, nonhomogeneous Lie Markov models have been identified as the class of models that are consistent in the face of a changing process of DNA substitutions regardless of taxon sampling. Some well-known models in popular use are within this class, but are either overly simplistic (e.g., the Kimura two-parameter model) or overly complex (the general Markov model). On a diverse set of biological data sets, we test a hierarchy of Lie Markov models spanning the full range of parameter richness. Compared against the benchmark of the ever-popular GTR model, we find that as a whole the Lie Markov models perform well, with the best performing models having 8–10 parameters and the ability to recognize the distinction between purines and pyrimidines.
format Online
Article
Text
id pubmed-4468350
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-44683502015-06-17 A New Hierarchy of Phylogenetic Models Consistent with Heterogeneous Substitution Rates Woodhams, Michael D. Fernández-Sánchez, Jesús Sumner, Jeremy G. Syst Biol Regular Articles When the process underlying DNA substitutions varies across evolutionary history, some standard Markov models underlying phylogenetic methods are mathematically inconsistent. The most prominent example is the general time-reversible model (GTR) together with some, but not all, of its submodels. To rectify this deficiency, nonhomogeneous Lie Markov models have been identified as the class of models that are consistent in the face of a changing process of DNA substitutions regardless of taxon sampling. Some well-known models in popular use are within this class, but are either overly simplistic (e.g., the Kimura two-parameter model) or overly complex (the general Markov model). On a diverse set of biological data sets, we test a hierarchy of Lie Markov models spanning the full range of parameter richness. Compared against the benchmark of the ever-popular GTR model, we find that as a whole the Lie Markov models perform well, with the best performing models having 8–10 parameters and the ability to recognize the distinction between purines and pyrimidines. Oxford University Press 2015-07 2015-04-08 /pmc/articles/PMC4468350/ /pubmed/25858352 http://dx.doi.org/10.1093/sysbio/syv021 Text en © The Author(s) 2015. Published by Oxford University Press on behalf of the Society of Systematic Biologists. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Regular Articles
Woodhams, Michael D.
Fernández-Sánchez, Jesús
Sumner, Jeremy G.
A New Hierarchy of Phylogenetic Models Consistent with Heterogeneous Substitution Rates
title A New Hierarchy of Phylogenetic Models Consistent with Heterogeneous Substitution Rates
title_full A New Hierarchy of Phylogenetic Models Consistent with Heterogeneous Substitution Rates
title_fullStr A New Hierarchy of Phylogenetic Models Consistent with Heterogeneous Substitution Rates
title_full_unstemmed A New Hierarchy of Phylogenetic Models Consistent with Heterogeneous Substitution Rates
title_short A New Hierarchy of Phylogenetic Models Consistent with Heterogeneous Substitution Rates
title_sort new hierarchy of phylogenetic models consistent with heterogeneous substitution rates
topic Regular Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4468350/
https://www.ncbi.nlm.nih.gov/pubmed/25858352
http://dx.doi.org/10.1093/sysbio/syv021
work_keys_str_mv AT woodhamsmichaeld anewhierarchyofphylogeneticmodelsconsistentwithheterogeneoussubstitutionrates
AT fernandezsanchezjesus anewhierarchyofphylogeneticmodelsconsistentwithheterogeneoussubstitutionrates
AT sumnerjeremyg anewhierarchyofphylogeneticmodelsconsistentwithheterogeneoussubstitutionrates
AT woodhamsmichaeld newhierarchyofphylogeneticmodelsconsistentwithheterogeneoussubstitutionrates
AT fernandezsanchezjesus newhierarchyofphylogeneticmodelsconsistentwithheterogeneoussubstitutionrates
AT sumnerjeremyg newhierarchyofphylogeneticmodelsconsistentwithheterogeneoussubstitutionrates