Cargando…
The space of phylogenetic mixtures for equivariant models
BACKGROUND: The selection of an evolutionary model to best fit given molecular data is usually a heuristic choice. In his seminal book, J. Felsenstein suggested that certain linear equations satisfied by the expected probabilities of patterns observed at the leaves of a phylogenetic tree could be us...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3608327/ https://www.ncbi.nlm.nih.gov/pubmed/23190710 http://dx.doi.org/10.1186/1748-7188-7-33 |
_version_ | 1782264224316129280 |
---|---|
author | Casanellas, Marta Fernández-Sánchez, Jesús Kedzierska, Anna M |
author_facet | Casanellas, Marta Fernández-Sánchez, Jesús Kedzierska, Anna M |
author_sort | Casanellas, Marta |
collection | PubMed |
description | BACKGROUND: The selection of an evolutionary model to best fit given molecular data is usually a heuristic choice. In his seminal book, J. Felsenstein suggested that certain linear equations satisfied by the expected probabilities of patterns observed at the leaves of a phylogenetic tree could be used for model selection. It remained an open question, however, whether these equations were sufficient to fully characterize the evolutionary model under consideration. RESULTS: Here we prove that, for most equivariant models of evolution, the space of distributions satisfying these linear equations coincides with the space of distributions arising from mixtures of trees. In other words, we prove that the evolution of an observed multiple sequence alignment can be modeled by a mixture of phylogenetic trees under an equivariant evolutionary model if and only if the distribution of patterns at its columns satisfies the linear equations mentioned above. Moreover, we provide a set of linearly independent equations defining this space of phylogenetic mixtures for each equivariant model and for any number of taxa. Lastly, we use these results to perform a study of identifiability of phylogenetic mixtures. CONCLUSIONS: The space of phylogenetic mixtures under equivariant models is a linear space that fully characterizes the evolutionary model. We provide an explicit algorithm to obtain the equations defining these spaces for a number of models and taxa. Its implementation has proved to be a powerful tool for model selection. |
format | Online Article Text |
id | pubmed-3608327 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-36083272013-03-29 The space of phylogenetic mixtures for equivariant models Casanellas, Marta Fernández-Sánchez, Jesús Kedzierska, Anna M Algorithms Mol Biol Research BACKGROUND: The selection of an evolutionary model to best fit given molecular data is usually a heuristic choice. In his seminal book, J. Felsenstein suggested that certain linear equations satisfied by the expected probabilities of patterns observed at the leaves of a phylogenetic tree could be used for model selection. It remained an open question, however, whether these equations were sufficient to fully characterize the evolutionary model under consideration. RESULTS: Here we prove that, for most equivariant models of evolution, the space of distributions satisfying these linear equations coincides with the space of distributions arising from mixtures of trees. In other words, we prove that the evolution of an observed multiple sequence alignment can be modeled by a mixture of phylogenetic trees under an equivariant evolutionary model if and only if the distribution of patterns at its columns satisfies the linear equations mentioned above. Moreover, we provide a set of linearly independent equations defining this space of phylogenetic mixtures for each equivariant model and for any number of taxa. Lastly, we use these results to perform a study of identifiability of phylogenetic mixtures. CONCLUSIONS: The space of phylogenetic mixtures under equivariant models is a linear space that fully characterizes the evolutionary model. We provide an explicit algorithm to obtain the equations defining these spaces for a number of models and taxa. Its implementation has proved to be a powerful tool for model selection. BioMed Central 2012-11-28 /pmc/articles/PMC3608327/ /pubmed/23190710 http://dx.doi.org/10.1186/1748-7188-7-33 Text en Copyright ©2012 Casanellas et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Casanellas, Marta Fernández-Sánchez, Jesús Kedzierska, Anna M The space of phylogenetic mixtures for equivariant models |
title | The space of phylogenetic mixtures for equivariant models |
title_full | The space of phylogenetic mixtures for equivariant models |
title_fullStr | The space of phylogenetic mixtures for equivariant models |
title_full_unstemmed | The space of phylogenetic mixtures for equivariant models |
title_short | The space of phylogenetic mixtures for equivariant models |
title_sort | space of phylogenetic mixtures for equivariant models |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3608327/ https://www.ncbi.nlm.nih.gov/pubmed/23190710 http://dx.doi.org/10.1186/1748-7188-7-33 |
work_keys_str_mv | AT casanellasmarta thespaceofphylogeneticmixturesforequivariantmodels AT fernandezsanchezjesus thespaceofphylogeneticmixturesforequivariantmodels AT kedzierskaannam thespaceofphylogeneticmixturesforequivariantmodels AT casanellasmarta spaceofphylogeneticmixturesforequivariantmodels AT fernandezsanchezjesus spaceofphylogeneticmixturesforequivariantmodels AT kedzierskaannam spaceofphylogeneticmixturesforequivariantmodels |