Cargando…

Site-specific time heterogeneity of the substitution process and its impact on phylogenetic inference

BACKGROUND: Model violations constitute the major limitation in inferring accurate phylogenies. Characterizing properties of the data that are not being correctly handled by current models is therefore of prime importance. One of the properties of protein evolution is the variation of the relative r...

Descripción completa

Detalles Bibliográficos
Autores principales: Roure, Béatrice, Philippe, Hervé
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3034684/
https://www.ncbi.nlm.nih.gov/pubmed/21235782
http://dx.doi.org/10.1186/1471-2148-11-17
_version_ 1782197683831701504
author Roure, Béatrice
Philippe, Hervé
author_facet Roure, Béatrice
Philippe, Hervé
author_sort Roure, Béatrice
collection PubMed
description BACKGROUND: Model violations constitute the major limitation in inferring accurate phylogenies. Characterizing properties of the data that are not being correctly handled by current models is therefore of prime importance. One of the properties of protein evolution is the variation of the relative rate of substitutions across sites and over time, the latter is the phenomenon called heterotachy. Its effect on phylogenetic inference has recently obtained considerable attention, which led to the development of new models of sequence evolution. However, thus far focus has been on the quantitative heterogeneity of the evolutionary process, thereby overlooking more qualitative variations. RESULTS: We studied the importance of variation of the site-specific amino-acid substitution process over time and its possible impact on phylogenetic inference. We used the CAT model to define an infinite mixture of substitution processes characterized by equilibrium frequencies over the twenty amino acids, a useful proxy for qualitatively estimating the evolutionary process. Using two large datasets, we show that qualitative changes in site-specific substitution properties over time occurred significantly. To test whether this unaccounted qualitative variation can lead to an erroneous phylogenetic tree, we analyzed a concatenation of mitochondrial proteins in which Cnidaria and Porifera were erroneously grouped. The progressive removal of the sites with the most heterogeneous CAT profiles across clades led to the recovery of the monophyly of Eumetazoa (Cnidaria+Bilateria), suggesting that this heterogeneity can negatively influence phylogenetic inference. CONCLUSION: The time-heterogeneity of the amino-acid replacement process is therefore an important evolutionary aspect that should be incorporated in future models of sequence change.
format Text
id pubmed-3034684
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30346842011-02-17 Site-specific time heterogeneity of the substitution process and its impact on phylogenetic inference Roure, Béatrice Philippe, Hervé BMC Evol Biol Research Article BACKGROUND: Model violations constitute the major limitation in inferring accurate phylogenies. Characterizing properties of the data that are not being correctly handled by current models is therefore of prime importance. One of the properties of protein evolution is the variation of the relative rate of substitutions across sites and over time, the latter is the phenomenon called heterotachy. Its effect on phylogenetic inference has recently obtained considerable attention, which led to the development of new models of sequence evolution. However, thus far focus has been on the quantitative heterogeneity of the evolutionary process, thereby overlooking more qualitative variations. RESULTS: We studied the importance of variation of the site-specific amino-acid substitution process over time and its possible impact on phylogenetic inference. We used the CAT model to define an infinite mixture of substitution processes characterized by equilibrium frequencies over the twenty amino acids, a useful proxy for qualitatively estimating the evolutionary process. Using two large datasets, we show that qualitative changes in site-specific substitution properties over time occurred significantly. To test whether this unaccounted qualitative variation can lead to an erroneous phylogenetic tree, we analyzed a concatenation of mitochondrial proteins in which Cnidaria and Porifera were erroneously grouped. The progressive removal of the sites with the most heterogeneous CAT profiles across clades led to the recovery of the monophyly of Eumetazoa (Cnidaria+Bilateria), suggesting that this heterogeneity can negatively influence phylogenetic inference. CONCLUSION: The time-heterogeneity of the amino-acid replacement process is therefore an important evolutionary aspect that should be incorporated in future models of sequence change. BioMed Central 2011-01-14 /pmc/articles/PMC3034684/ /pubmed/21235782 http://dx.doi.org/10.1186/1471-2148-11-17 Text en Copyright ©2011 Roure and Philippe; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Roure, Béatrice
Philippe, Hervé
Site-specific time heterogeneity of the substitution process and its impact on phylogenetic inference
title Site-specific time heterogeneity of the substitution process and its impact on phylogenetic inference
title_full Site-specific time heterogeneity of the substitution process and its impact on phylogenetic inference
title_fullStr Site-specific time heterogeneity of the substitution process and its impact on phylogenetic inference
title_full_unstemmed Site-specific time heterogeneity of the substitution process and its impact on phylogenetic inference
title_short Site-specific time heterogeneity of the substitution process and its impact on phylogenetic inference
title_sort site-specific time heterogeneity of the substitution process and its impact on phylogenetic inference
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3034684/
https://www.ncbi.nlm.nih.gov/pubmed/21235782
http://dx.doi.org/10.1186/1471-2148-11-17
work_keys_str_mv AT rourebeatrice sitespecifictimeheterogeneityofthesubstitutionprocessanditsimpactonphylogeneticinference
AT philippeherve sitespecifictimeheterogeneityofthesubstitutionprocessanditsimpactonphylogeneticinference