Cargando…
Data-driven speciation tree prior for better species divergence times in calibration-poor molecular phylogenies
MOTIVATION: Precise time calibrations needed to estimate ages of species divergence are not always available due to fossil records' incompleteness. Consequently, clock calibrations available for Bayesian dating analyses can be few and diffused, i.e. phylogenies are calibration-poor, impeding re...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8275332/ https://www.ncbi.nlm.nih.gov/pubmed/34252953 http://dx.doi.org/10.1093/bioinformatics/btab307 |
_version_ | 1783721691711537152 |
---|---|
author | Tao, Qiqing Barba-Montoya, Jose Kumar, Sudhir |
author_facet | Tao, Qiqing Barba-Montoya, Jose Kumar, Sudhir |
author_sort | Tao, Qiqing |
collection | PubMed |
description | MOTIVATION: Precise time calibrations needed to estimate ages of species divergence are not always available due to fossil records' incompleteness. Consequently, clock calibrations available for Bayesian dating analyses can be few and diffused, i.e. phylogenies are calibration-poor, impeding reliable inference of the timetree of life. We examined the role of speciation birth–death (BD) tree prior on Bayesian node age estimates in calibration-poor phylogenies and tested the usefulness of an informative, data-driven tree prior to enhancing the accuracy and precision of estimated times. RESULTS: We present a simple method to estimate parameters of the BD tree prior from the molecular phylogeny for use in Bayesian dating analyses. The use of a data-driven birth–death (ddBD) tree prior leads to improvement in Bayesian node age estimates for calibration-poor phylogenies. We show that the ddBD tree prior, along with only a few well-constrained calibrations, can produce excellent node ages and credibility intervals, whereas the use of an uninformative, uniform (flat) tree prior may require more calibrations. Relaxed clock dating with ddBD tree prior also produced better results than a flat tree prior when using diffused node calibrations. We also suggest using ddBD tree priors to improve the detection of outliers and influential calibrations in cross-validation analyses. These results have practical applications because the ddBD tree prior reduces the number of well-constrained calibrations necessary to obtain reliable node age estimates. This would help address key impediments in building the grand timetree of life, revealing the process of speciation and elucidating the dynamics of biological diversification. AVAILABILITY AND IMPLEMENTATION: An R module for computing the ddBD tree prior, simulated datasets and empirical datasets are available at https://github.com/cathyqqtao/ddBD-tree-prior. |
format | Online Article Text |
id | pubmed-8275332 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-82753322021-07-13 Data-driven speciation tree prior for better species divergence times in calibration-poor molecular phylogenies Tao, Qiqing Barba-Montoya, Jose Kumar, Sudhir Bioinformatics Evolutionary, Comparative and Population Genomics MOTIVATION: Precise time calibrations needed to estimate ages of species divergence are not always available due to fossil records' incompleteness. Consequently, clock calibrations available for Bayesian dating analyses can be few and diffused, i.e. phylogenies are calibration-poor, impeding reliable inference of the timetree of life. We examined the role of speciation birth–death (BD) tree prior on Bayesian node age estimates in calibration-poor phylogenies and tested the usefulness of an informative, data-driven tree prior to enhancing the accuracy and precision of estimated times. RESULTS: We present a simple method to estimate parameters of the BD tree prior from the molecular phylogeny for use in Bayesian dating analyses. The use of a data-driven birth–death (ddBD) tree prior leads to improvement in Bayesian node age estimates for calibration-poor phylogenies. We show that the ddBD tree prior, along with only a few well-constrained calibrations, can produce excellent node ages and credibility intervals, whereas the use of an uninformative, uniform (flat) tree prior may require more calibrations. Relaxed clock dating with ddBD tree prior also produced better results than a flat tree prior when using diffused node calibrations. We also suggest using ddBD tree priors to improve the detection of outliers and influential calibrations in cross-validation analyses. These results have practical applications because the ddBD tree prior reduces the number of well-constrained calibrations necessary to obtain reliable node age estimates. This would help address key impediments in building the grand timetree of life, revealing the process of speciation and elucidating the dynamics of biological diversification. AVAILABILITY AND IMPLEMENTATION: An R module for computing the ddBD tree prior, simulated datasets and empirical datasets are available at https://github.com/cathyqqtao/ddBD-tree-prior. Oxford University Press 2021-07-12 /pmc/articles/PMC8275332/ /pubmed/34252953 http://dx.doi.org/10.1093/bioinformatics/btab307 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Evolutionary, Comparative and Population Genomics Tao, Qiqing Barba-Montoya, Jose Kumar, Sudhir Data-driven speciation tree prior for better species divergence times in calibration-poor molecular phylogenies |
title | Data-driven speciation tree prior for better species divergence times in calibration-poor molecular phylogenies |
title_full | Data-driven speciation tree prior for better species divergence times in calibration-poor molecular phylogenies |
title_fullStr | Data-driven speciation tree prior for better species divergence times in calibration-poor molecular phylogenies |
title_full_unstemmed | Data-driven speciation tree prior for better species divergence times in calibration-poor molecular phylogenies |
title_short | Data-driven speciation tree prior for better species divergence times in calibration-poor molecular phylogenies |
title_sort | data-driven speciation tree prior for better species divergence times in calibration-poor molecular phylogenies |
topic | Evolutionary, Comparative and Population Genomics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8275332/ https://www.ncbi.nlm.nih.gov/pubmed/34252953 http://dx.doi.org/10.1093/bioinformatics/btab307 |
work_keys_str_mv | AT taoqiqing datadrivenspeciationtreepriorforbetterspeciesdivergencetimesincalibrationpoormolecularphylogenies AT barbamontoyajose datadrivenspeciationtreepriorforbetterspeciesdivergencetimesincalibrationpoormolecularphylogenies AT kumarsudhir datadrivenspeciationtreepriorforbetterspeciesdivergencetimesincalibrationpoormolecularphylogenies |