Cargando…

Evaluating the Performance of Widely Used Phylogenetic Models for Gene Expression Evolution

Phylogenetic comparative methods are increasingly used to test hypotheses about the evolutionary processes that drive divergence in gene expression among species. However, it is unknown whether the distributional assumptions of phylogenetic models designed for quantitative phenotypic traits are real...

Descripción completa

Detalles Bibliográficos
Autores principales: Dimayacyac, Jose Rafael, Wu, Shanyun, Jiang, Daohan, Pennell, Matt
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10461906/
https://www.ncbi.nlm.nih.gov/pubmed/37645857
http://dx.doi.org/10.1101/2023.02.09.527893
_version_ 1785097934398816256
author Dimayacyac, Jose Rafael
Wu, Shanyun
Jiang, Daohan
Pennell, Matt
author_facet Dimayacyac, Jose Rafael
Wu, Shanyun
Jiang, Daohan
Pennell, Matt
author_sort Dimayacyac, Jose Rafael
collection PubMed
description Phylogenetic comparative methods are increasingly used to test hypotheses about the evolutionary processes that drive divergence in gene expression among species. However, it is unknown whether the distributional assumptions of phylogenetic models designed for quantitative phenotypic traits are realistic for expression data and importantly, the reliability of conclusions of phylogenetic comparative studies of gene expression may depend on whether the data is well-described by the chosen model. To evaluate this, we first fit several phylogenetic models of trait evolution to 8 previously published comparative expression datasets, comprising a total of 54,774 genes with 145,927 unique gene-tissue combinations. Using a previously developed approach, we then assessed how well the best model of the set described the data in an absolute (not just relative) sense. First, we find that Ornstein-Uhlenbeck models, in which expression values are constrained around an optimum, were the preferred model for 66% of gene-tissue combinations. Second, we find that for 61% of gene-tissue combinations, the best fit model of the set was found to perform well; the rest were found to be performing poorly by at least one of the test statistics we examined. Third, we find that when simple models do not perform well, this appears to be typically a consequence of failing to fully account for heterogeneity in the rate of the evolution. We advocate that assessment of model performance should become a routine component of phylogenetic comparative expression studies; doing so can improve the reliability of inferences and inspire the development of novel models.
format Online
Article
Text
id pubmed-10461906
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-104619062023-08-29 Evaluating the Performance of Widely Used Phylogenetic Models for Gene Expression Evolution Dimayacyac, Jose Rafael Wu, Shanyun Jiang, Daohan Pennell, Matt bioRxiv Article Phylogenetic comparative methods are increasingly used to test hypotheses about the evolutionary processes that drive divergence in gene expression among species. However, it is unknown whether the distributional assumptions of phylogenetic models designed for quantitative phenotypic traits are realistic for expression data and importantly, the reliability of conclusions of phylogenetic comparative studies of gene expression may depend on whether the data is well-described by the chosen model. To evaluate this, we first fit several phylogenetic models of trait evolution to 8 previously published comparative expression datasets, comprising a total of 54,774 genes with 145,927 unique gene-tissue combinations. Using a previously developed approach, we then assessed how well the best model of the set described the data in an absolute (not just relative) sense. First, we find that Ornstein-Uhlenbeck models, in which expression values are constrained around an optimum, were the preferred model for 66% of gene-tissue combinations. Second, we find that for 61% of gene-tissue combinations, the best fit model of the set was found to perform well; the rest were found to be performing poorly by at least one of the test statistics we examined. Third, we find that when simple models do not perform well, this appears to be typically a consequence of failing to fully account for heterogeneity in the rate of the evolution. We advocate that assessment of model performance should become a routine component of phylogenetic comparative expression studies; doing so can improve the reliability of inferences and inspire the development of novel models. Cold Spring Harbor Laboratory 2023-08-17 /pmc/articles/PMC10461906/ /pubmed/37645857 http://dx.doi.org/10.1101/2023.02.09.527893 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Dimayacyac, Jose Rafael
Wu, Shanyun
Jiang, Daohan
Pennell, Matt
Evaluating the Performance of Widely Used Phylogenetic Models for Gene Expression Evolution
title Evaluating the Performance of Widely Used Phylogenetic Models for Gene Expression Evolution
title_full Evaluating the Performance of Widely Used Phylogenetic Models for Gene Expression Evolution
title_fullStr Evaluating the Performance of Widely Used Phylogenetic Models for Gene Expression Evolution
title_full_unstemmed Evaluating the Performance of Widely Used Phylogenetic Models for Gene Expression Evolution
title_short Evaluating the Performance of Widely Used Phylogenetic Models for Gene Expression Evolution
title_sort evaluating the performance of widely used phylogenetic models for gene expression evolution
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10461906/
https://www.ncbi.nlm.nih.gov/pubmed/37645857
http://dx.doi.org/10.1101/2023.02.09.527893
work_keys_str_mv AT dimayacyacjoserafael evaluatingtheperformanceofwidelyusedphylogeneticmodelsforgeneexpressionevolution
AT wushanyun evaluatingtheperformanceofwidelyusedphylogeneticmodelsforgeneexpressionevolution
AT jiangdaohan evaluatingtheperformanceofwidelyusedphylogeneticmodelsforgeneexpressionevolution
AT pennellmatt evaluatingtheperformanceofwidelyusedphylogeneticmodelsforgeneexpressionevolution