Cargando…

Impact of Phylogenetic Tree Completeness and Mis-specification of Sampling Fractions on Trait Dependent Diversification Models

Understanding the origins of diversity and the factors that drive some clades to be more diverse than others are important issues in evolutionary biology. Sophisticated SSE (state-dependent speciation and extinction) models provide insights into the association between diversification rates and the...

Descripción completa

Detalles Bibliográficos
Autores principales: Mynard, Poppy, Algar, Adam C, Lancaster, Lesley T, Bocedi, Greta, Fahri, Fahri, Gubry-Rangin, Cécile, Lupiyaningdyah, Pungki, Nangoy, Meis, Osborne, Owen G, Papadopulos, Alexander S T, Sudiana, I Made, Juliandi, Berry, Travis, Justin M J, Herrera-Alsina, Leonel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10198648/
https://www.ncbi.nlm.nih.gov/pubmed/36645380
http://dx.doi.org/10.1093/sysbio/syad001
_version_ 1785044778055892992
author Mynard, Poppy
Algar, Adam C
Lancaster, Lesley T
Bocedi, Greta
Fahri, Fahri
Gubry-Rangin, Cécile
Lupiyaningdyah, Pungki
Nangoy, Meis
Osborne, Owen G
Papadopulos, Alexander S T
Sudiana, I Made
Juliandi, Berry
Travis, Justin M J
Herrera-Alsina, Leonel
author_facet Mynard, Poppy
Algar, Adam C
Lancaster, Lesley T
Bocedi, Greta
Fahri, Fahri
Gubry-Rangin, Cécile
Lupiyaningdyah, Pungki
Nangoy, Meis
Osborne, Owen G
Papadopulos, Alexander S T
Sudiana, I Made
Juliandi, Berry
Travis, Justin M J
Herrera-Alsina, Leonel
author_sort Mynard, Poppy
collection PubMed
description Understanding the origins of diversity and the factors that drive some clades to be more diverse than others are important issues in evolutionary biology. Sophisticated SSE (state-dependent speciation and extinction) models provide insights into the association between diversification rates and the evolution of a trait. The empirical data used in SSE models and other methods is normally imperfect, yet little is known about how this can affect these models. Here, we evaluate the impact of common phylogenetic issues on inferences drawn from SSE models. Using simulated phylogenetic trees and trait information, we fitted SSE models to determine the effects of sampling fraction (phylogenetic tree completeness) and sampling fraction mis-specification on model selection and parameter estimation (speciation, extinction, and transition rates) under two sampling regimes (random and taxonomically biased). As expected, we found that both model selection and parameter estimate accuracies are reduced at lower sampling fractions (i.e., low tree completeness). Furthermore, when sampling of the tree is imbalanced across sub-clades and tree completeness is ≤ 60%, rates of false positives increase and parameter estimates are less accurate, compared to when sampling is random. Thus, when applying SSE methods to empirical datasets, there are increased risks of false inferences of trait dependent diversification when some sub-clades are heavily under-sampled. Mis-specifying the sampling fraction severely affected the accuracy of parameter estimates: parameter values were over-estimated when the sampling fraction was specified as lower than its true value, and under-estimated when the sampling fraction was specified as higher than its true value. Our results suggest that it is better to cautiously under-estimate sampling efforts, as false positives increased when the sampling fraction was over-estimated. We encourage SSE studies where the sampling fraction can be reasonably estimated and provide recommended best practices for SSE modeling. [Trait dependent diversification; SSE models; phylogenetic tree completeness; sampling fraction.]
format Online
Article
Text
id pubmed-10198648
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101986482023-05-20 Impact of Phylogenetic Tree Completeness and Mis-specification of Sampling Fractions on Trait Dependent Diversification Models Mynard, Poppy Algar, Adam C Lancaster, Lesley T Bocedi, Greta Fahri, Fahri Gubry-Rangin, Cécile Lupiyaningdyah, Pungki Nangoy, Meis Osborne, Owen G Papadopulos, Alexander S T Sudiana, I Made Juliandi, Berry Travis, Justin M J Herrera-Alsina, Leonel Syst Biol Spotlight Articles Understanding the origins of diversity and the factors that drive some clades to be more diverse than others are important issues in evolutionary biology. Sophisticated SSE (state-dependent speciation and extinction) models provide insights into the association between diversification rates and the evolution of a trait. The empirical data used in SSE models and other methods is normally imperfect, yet little is known about how this can affect these models. Here, we evaluate the impact of common phylogenetic issues on inferences drawn from SSE models. Using simulated phylogenetic trees and trait information, we fitted SSE models to determine the effects of sampling fraction (phylogenetic tree completeness) and sampling fraction mis-specification on model selection and parameter estimation (speciation, extinction, and transition rates) under two sampling regimes (random and taxonomically biased). As expected, we found that both model selection and parameter estimate accuracies are reduced at lower sampling fractions (i.e., low tree completeness). Furthermore, when sampling of the tree is imbalanced across sub-clades and tree completeness is ≤ 60%, rates of false positives increase and parameter estimates are less accurate, compared to when sampling is random. Thus, when applying SSE methods to empirical datasets, there are increased risks of false inferences of trait dependent diversification when some sub-clades are heavily under-sampled. Mis-specifying the sampling fraction severely affected the accuracy of parameter estimates: parameter values were over-estimated when the sampling fraction was specified as lower than its true value, and under-estimated when the sampling fraction was specified as higher than its true value. Our results suggest that it is better to cautiously under-estimate sampling efforts, as false positives increased when the sampling fraction was over-estimated. We encourage SSE studies where the sampling fraction can be reasonably estimated and provide recommended best practices for SSE modeling. [Trait dependent diversification; SSE models; phylogenetic tree completeness; sampling fraction.] Oxford University Press 2023-01-16 /pmc/articles/PMC10198648/ /pubmed/36645380 http://dx.doi.org/10.1093/sysbio/syad001 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the Society of Systematic Biologists. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Spotlight Articles
Mynard, Poppy
Algar, Adam C
Lancaster, Lesley T
Bocedi, Greta
Fahri, Fahri
Gubry-Rangin, Cécile
Lupiyaningdyah, Pungki
Nangoy, Meis
Osborne, Owen G
Papadopulos, Alexander S T
Sudiana, I Made
Juliandi, Berry
Travis, Justin M J
Herrera-Alsina, Leonel
Impact of Phylogenetic Tree Completeness and Mis-specification of Sampling Fractions on Trait Dependent Diversification Models
title Impact of Phylogenetic Tree Completeness and Mis-specification of Sampling Fractions on Trait Dependent Diversification Models
title_full Impact of Phylogenetic Tree Completeness and Mis-specification of Sampling Fractions on Trait Dependent Diversification Models
title_fullStr Impact of Phylogenetic Tree Completeness and Mis-specification of Sampling Fractions on Trait Dependent Diversification Models
title_full_unstemmed Impact of Phylogenetic Tree Completeness and Mis-specification of Sampling Fractions on Trait Dependent Diversification Models
title_short Impact of Phylogenetic Tree Completeness and Mis-specification of Sampling Fractions on Trait Dependent Diversification Models
title_sort impact of phylogenetic tree completeness and mis-specification of sampling fractions on trait dependent diversification models
topic Spotlight Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10198648/
https://www.ncbi.nlm.nih.gov/pubmed/36645380
http://dx.doi.org/10.1093/sysbio/syad001
work_keys_str_mv AT mynardpoppy impactofphylogenetictreecompletenessandmisspecificationofsamplingfractionsontraitdependentdiversificationmodels
AT algaradamc impactofphylogenetictreecompletenessandmisspecificationofsamplingfractionsontraitdependentdiversificationmodels
AT lancasterlesleyt impactofphylogenetictreecompletenessandmisspecificationofsamplingfractionsontraitdependentdiversificationmodels
AT bocedigreta impactofphylogenetictreecompletenessandmisspecificationofsamplingfractionsontraitdependentdiversificationmodels
AT fahrifahri impactofphylogenetictreecompletenessandmisspecificationofsamplingfractionsontraitdependentdiversificationmodels
AT gubryrangincecile impactofphylogenetictreecompletenessandmisspecificationofsamplingfractionsontraitdependentdiversificationmodels
AT lupiyaningdyahpungki impactofphylogenetictreecompletenessandmisspecificationofsamplingfractionsontraitdependentdiversificationmodels
AT nangoymeis impactofphylogenetictreecompletenessandmisspecificationofsamplingfractionsontraitdependentdiversificationmodels
AT osborneoweng impactofphylogenetictreecompletenessandmisspecificationofsamplingfractionsontraitdependentdiversificationmodels
AT papadopulosalexanderst impactofphylogenetictreecompletenessandmisspecificationofsamplingfractionsontraitdependentdiversificationmodels
AT sudianaimade impactofphylogenetictreecompletenessandmisspecificationofsamplingfractionsontraitdependentdiversificationmodels
AT juliandiberry impactofphylogenetictreecompletenessandmisspecificationofsamplingfractionsontraitdependentdiversificationmodels
AT travisjustinmj impactofphylogenetictreecompletenessandmisspecificationofsamplingfractionsontraitdependentdiversificationmodels
AT herreraalsinaleonel impactofphylogenetictreecompletenessandmisspecificationofsamplingfractionsontraitdependentdiversificationmodels