Cargando…

The space of phylogenetic mixtures for equivariant models

BACKGROUND: The selection of an evolutionary model to best fit given molecular data is usually a heuristic choice. In his seminal book, J. Felsenstein suggested that certain linear equations satisfied by the expected probabilities of patterns observed at the leaves of a phylogenetic tree could be us...

Descripción completa

Detalles Bibliográficos
Autores principales: Casanellas, Marta, Fernández-Sánchez, Jesús, Kedzierska, Anna M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3608327/
https://www.ncbi.nlm.nih.gov/pubmed/23190710
http://dx.doi.org/10.1186/1748-7188-7-33
_version_ 1782264224316129280
author Casanellas, Marta
Fernández-Sánchez, Jesús
Kedzierska, Anna M
author_facet Casanellas, Marta
Fernández-Sánchez, Jesús
Kedzierska, Anna M
author_sort Casanellas, Marta
collection PubMed
description BACKGROUND: The selection of an evolutionary model to best fit given molecular data is usually a heuristic choice. In his seminal book, J. Felsenstein suggested that certain linear equations satisfied by the expected probabilities of patterns observed at the leaves of a phylogenetic tree could be used for model selection. It remained an open question, however, whether these equations were sufficient to fully characterize the evolutionary model under consideration. RESULTS: Here we prove that, for most equivariant models of evolution, the space of distributions satisfying these linear equations coincides with the space of distributions arising from mixtures of trees. In other words, we prove that the evolution of an observed multiple sequence alignment can be modeled by a mixture of phylogenetic trees under an equivariant evolutionary model if and only if the distribution of patterns at its columns satisfies the linear equations mentioned above. Moreover, we provide a set of linearly independent equations defining this space of phylogenetic mixtures for each equivariant model and for any number of taxa. Lastly, we use these results to perform a study of identifiability of phylogenetic mixtures. CONCLUSIONS: The space of phylogenetic mixtures under equivariant models is a linear space that fully characterizes the evolutionary model. We provide an explicit algorithm to obtain the equations defining these spaces for a number of models and taxa. Its implementation has proved to be a powerful tool for model selection.
format Online
Article
Text
id pubmed-3608327
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36083272013-03-29 The space of phylogenetic mixtures for equivariant models Casanellas, Marta Fernández-Sánchez, Jesús Kedzierska, Anna M Algorithms Mol Biol Research BACKGROUND: The selection of an evolutionary model to best fit given molecular data is usually a heuristic choice. In his seminal book, J. Felsenstein suggested that certain linear equations satisfied by the expected probabilities of patterns observed at the leaves of a phylogenetic tree could be used for model selection. It remained an open question, however, whether these equations were sufficient to fully characterize the evolutionary model under consideration. RESULTS: Here we prove that, for most equivariant models of evolution, the space of distributions satisfying these linear equations coincides with the space of distributions arising from mixtures of trees. In other words, we prove that the evolution of an observed multiple sequence alignment can be modeled by a mixture of phylogenetic trees under an equivariant evolutionary model if and only if the distribution of patterns at its columns satisfies the linear equations mentioned above. Moreover, we provide a set of linearly independent equations defining this space of phylogenetic mixtures for each equivariant model and for any number of taxa. Lastly, we use these results to perform a study of identifiability of phylogenetic mixtures. CONCLUSIONS: The space of phylogenetic mixtures under equivariant models is a linear space that fully characterizes the evolutionary model. We provide an explicit algorithm to obtain the equations defining these spaces for a number of models and taxa. Its implementation has proved to be a powerful tool for model selection. BioMed Central 2012-11-28 /pmc/articles/PMC3608327/ /pubmed/23190710 http://dx.doi.org/10.1186/1748-7188-7-33 Text en Copyright ©2012 Casanellas et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Casanellas, Marta
Fernández-Sánchez, Jesús
Kedzierska, Anna M
The space of phylogenetic mixtures for equivariant models
title The space of phylogenetic mixtures for equivariant models
title_full The space of phylogenetic mixtures for equivariant models
title_fullStr The space of phylogenetic mixtures for equivariant models
title_full_unstemmed The space of phylogenetic mixtures for equivariant models
title_short The space of phylogenetic mixtures for equivariant models
title_sort space of phylogenetic mixtures for equivariant models
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3608327/
https://www.ncbi.nlm.nih.gov/pubmed/23190710
http://dx.doi.org/10.1186/1748-7188-7-33
work_keys_str_mv AT casanellasmarta thespaceofphylogeneticmixturesforequivariantmodels
AT fernandezsanchezjesus thespaceofphylogeneticmixturesforequivariantmodels
AT kedzierskaannam thespaceofphylogeneticmixturesforequivariantmodels
AT casanellasmarta spaceofphylogeneticmixturesforequivariantmodels
AT fernandezsanchezjesus spaceofphylogeneticmixturesforequivariantmodels
AT kedzierskaannam spaceofphylogeneticmixturesforequivariantmodels