Cargando…
Evaluating the Performance of Widely Used Phylogenetic Models for Gene Expression Evolution
Phylogenetic comparative methods are increasingly used to test hypotheses about the evolutionary processes that drive divergence in gene expression among species. However, it is unknown whether the distributional assumptions of phylogenetic models designed for quantitative phenotypic traits are real...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10461906/ https://www.ncbi.nlm.nih.gov/pubmed/37645857 http://dx.doi.org/10.1101/2023.02.09.527893 |
_version_ | 1785097934398816256 |
---|---|
author | Dimayacyac, Jose Rafael Wu, Shanyun Jiang, Daohan Pennell, Matt |
author_facet | Dimayacyac, Jose Rafael Wu, Shanyun Jiang, Daohan Pennell, Matt |
author_sort | Dimayacyac, Jose Rafael |
collection | PubMed |
description | Phylogenetic comparative methods are increasingly used to test hypotheses about the evolutionary processes that drive divergence in gene expression among species. However, it is unknown whether the distributional assumptions of phylogenetic models designed for quantitative phenotypic traits are realistic for expression data and importantly, the reliability of conclusions of phylogenetic comparative studies of gene expression may depend on whether the data is well-described by the chosen model. To evaluate this, we first fit several phylogenetic models of trait evolution to 8 previously published comparative expression datasets, comprising a total of 54,774 genes with 145,927 unique gene-tissue combinations. Using a previously developed approach, we then assessed how well the best model of the set described the data in an absolute (not just relative) sense. First, we find that Ornstein-Uhlenbeck models, in which expression values are constrained around an optimum, were the preferred model for 66% of gene-tissue combinations. Second, we find that for 61% of gene-tissue combinations, the best fit model of the set was found to perform well; the rest were found to be performing poorly by at least one of the test statistics we examined. Third, we find that when simple models do not perform well, this appears to be typically a consequence of failing to fully account for heterogeneity in the rate of the evolution. We advocate that assessment of model performance should become a routine component of phylogenetic comparative expression studies; doing so can improve the reliability of inferences and inspire the development of novel models. |
format | Online Article Text |
id | pubmed-10461906 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-104619062023-08-29 Evaluating the Performance of Widely Used Phylogenetic Models for Gene Expression Evolution Dimayacyac, Jose Rafael Wu, Shanyun Jiang, Daohan Pennell, Matt bioRxiv Article Phylogenetic comparative methods are increasingly used to test hypotheses about the evolutionary processes that drive divergence in gene expression among species. However, it is unknown whether the distributional assumptions of phylogenetic models designed for quantitative phenotypic traits are realistic for expression data and importantly, the reliability of conclusions of phylogenetic comparative studies of gene expression may depend on whether the data is well-described by the chosen model. To evaluate this, we first fit several phylogenetic models of trait evolution to 8 previously published comparative expression datasets, comprising a total of 54,774 genes with 145,927 unique gene-tissue combinations. Using a previously developed approach, we then assessed how well the best model of the set described the data in an absolute (not just relative) sense. First, we find that Ornstein-Uhlenbeck models, in which expression values are constrained around an optimum, were the preferred model for 66% of gene-tissue combinations. Second, we find that for 61% of gene-tissue combinations, the best fit model of the set was found to perform well; the rest were found to be performing poorly by at least one of the test statistics we examined. Third, we find that when simple models do not perform well, this appears to be typically a consequence of failing to fully account for heterogeneity in the rate of the evolution. We advocate that assessment of model performance should become a routine component of phylogenetic comparative expression studies; doing so can improve the reliability of inferences and inspire the development of novel models. Cold Spring Harbor Laboratory 2023-08-17 /pmc/articles/PMC10461906/ /pubmed/37645857 http://dx.doi.org/10.1101/2023.02.09.527893 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use. |
spellingShingle | Article Dimayacyac, Jose Rafael Wu, Shanyun Jiang, Daohan Pennell, Matt Evaluating the Performance of Widely Used Phylogenetic Models for Gene Expression Evolution |
title | Evaluating the Performance of Widely Used Phylogenetic Models for Gene Expression Evolution |
title_full | Evaluating the Performance of Widely Used Phylogenetic Models for Gene Expression Evolution |
title_fullStr | Evaluating the Performance of Widely Used Phylogenetic Models for Gene Expression Evolution |
title_full_unstemmed | Evaluating the Performance of Widely Used Phylogenetic Models for Gene Expression Evolution |
title_short | Evaluating the Performance of Widely Used Phylogenetic Models for Gene Expression Evolution |
title_sort | evaluating the performance of widely used phylogenetic models for gene expression evolution |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10461906/ https://www.ncbi.nlm.nih.gov/pubmed/37645857 http://dx.doi.org/10.1101/2023.02.09.527893 |
work_keys_str_mv | AT dimayacyacjoserafael evaluatingtheperformanceofwidelyusedphylogeneticmodelsforgeneexpressionevolution AT wushanyun evaluatingtheperformanceofwidelyusedphylogeneticmodelsforgeneexpressionevolution AT jiangdaohan evaluatingtheperformanceofwidelyusedphylogeneticmodelsforgeneexpressionevolution AT pennellmatt evaluatingtheperformanceofwidelyusedphylogeneticmodelsforgeneexpressionevolution |