Cargando…

Data-driven speciation tree prior for better species divergence times in calibration-poor molecular phylogenies

MOTIVATION: Precise time calibrations needed to estimate ages of species divergence are not always available due to fossil records' incompleteness. Consequently, clock calibrations available for Bayesian dating analyses can be few and diffused, i.e. phylogenies are calibration-poor, impeding re...

Descripción completa

Detalles Bibliográficos
Autores principales: Tao, Qiqing, Barba-Montoya, Jose, Kumar, Sudhir
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8275332/
https://www.ncbi.nlm.nih.gov/pubmed/34252953
http://dx.doi.org/10.1093/bioinformatics/btab307
_version_ 1783721691711537152
author Tao, Qiqing
Barba-Montoya, Jose
Kumar, Sudhir
author_facet Tao, Qiqing
Barba-Montoya, Jose
Kumar, Sudhir
author_sort Tao, Qiqing
collection PubMed
description MOTIVATION: Precise time calibrations needed to estimate ages of species divergence are not always available due to fossil records' incompleteness. Consequently, clock calibrations available for Bayesian dating analyses can be few and diffused, i.e. phylogenies are calibration-poor, impeding reliable inference of the timetree of life. We examined the role of speciation birth–death (BD) tree prior on Bayesian node age estimates in calibration-poor phylogenies and tested the usefulness of an informative, data-driven tree prior to enhancing the accuracy and precision of estimated times. RESULTS: We present a simple method to estimate parameters of the BD tree prior from the molecular phylogeny for use in Bayesian dating analyses. The use of a data-driven birth–death (ddBD) tree prior leads to improvement in Bayesian node age estimates for calibration-poor phylogenies. We show that the ddBD tree prior, along with only a few well-constrained calibrations, can produce excellent node ages and credibility intervals, whereas the use of an uninformative, uniform (flat) tree prior may require more calibrations. Relaxed clock dating with ddBD tree prior also produced better results than a flat tree prior when using diffused node calibrations. We also suggest using ddBD tree priors to improve the detection of outliers and influential calibrations in cross-validation analyses. These results have practical applications because the ddBD tree prior reduces the number of well-constrained calibrations necessary to obtain reliable node age estimates. This would help address key impediments in building the grand timetree of life, revealing the process of speciation and elucidating the dynamics of biological diversification. AVAILABILITY AND IMPLEMENTATION: An R module for computing the ddBD tree prior, simulated datasets and empirical datasets are available at https://github.com/cathyqqtao/ddBD-tree-prior.
format Online
Article
Text
id pubmed-8275332
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-82753322021-07-13 Data-driven speciation tree prior for better species divergence times in calibration-poor molecular phylogenies Tao, Qiqing Barba-Montoya, Jose Kumar, Sudhir Bioinformatics Evolutionary, Comparative and Population Genomics MOTIVATION: Precise time calibrations needed to estimate ages of species divergence are not always available due to fossil records' incompleteness. Consequently, clock calibrations available for Bayesian dating analyses can be few and diffused, i.e. phylogenies are calibration-poor, impeding reliable inference of the timetree of life. We examined the role of speciation birth–death (BD) tree prior on Bayesian node age estimates in calibration-poor phylogenies and tested the usefulness of an informative, data-driven tree prior to enhancing the accuracy and precision of estimated times. RESULTS: We present a simple method to estimate parameters of the BD tree prior from the molecular phylogeny for use in Bayesian dating analyses. The use of a data-driven birth–death (ddBD) tree prior leads to improvement in Bayesian node age estimates for calibration-poor phylogenies. We show that the ddBD tree prior, along with only a few well-constrained calibrations, can produce excellent node ages and credibility intervals, whereas the use of an uninformative, uniform (flat) tree prior may require more calibrations. Relaxed clock dating with ddBD tree prior also produced better results than a flat tree prior when using diffused node calibrations. We also suggest using ddBD tree priors to improve the detection of outliers and influential calibrations in cross-validation analyses. These results have practical applications because the ddBD tree prior reduces the number of well-constrained calibrations necessary to obtain reliable node age estimates. This would help address key impediments in building the grand timetree of life, revealing the process of speciation and elucidating the dynamics of biological diversification. AVAILABILITY AND IMPLEMENTATION: An R module for computing the ddBD tree prior, simulated datasets and empirical datasets are available at https://github.com/cathyqqtao/ddBD-tree-prior. Oxford University Press 2021-07-12 /pmc/articles/PMC8275332/ /pubmed/34252953 http://dx.doi.org/10.1093/bioinformatics/btab307 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Evolutionary, Comparative and Population Genomics
Tao, Qiqing
Barba-Montoya, Jose
Kumar, Sudhir
Data-driven speciation tree prior for better species divergence times in calibration-poor molecular phylogenies
title Data-driven speciation tree prior for better species divergence times in calibration-poor molecular phylogenies
title_full Data-driven speciation tree prior for better species divergence times in calibration-poor molecular phylogenies
title_fullStr Data-driven speciation tree prior for better species divergence times in calibration-poor molecular phylogenies
title_full_unstemmed Data-driven speciation tree prior for better species divergence times in calibration-poor molecular phylogenies
title_short Data-driven speciation tree prior for better species divergence times in calibration-poor molecular phylogenies
title_sort data-driven speciation tree prior for better species divergence times in calibration-poor molecular phylogenies
topic Evolutionary, Comparative and Population Genomics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8275332/
https://www.ncbi.nlm.nih.gov/pubmed/34252953
http://dx.doi.org/10.1093/bioinformatics/btab307
work_keys_str_mv AT taoqiqing datadrivenspeciationtreepriorforbetterspeciesdivergencetimesincalibrationpoormolecularphylogenies
AT barbamontoyajose datadrivenspeciationtreepriorforbetterspeciesdivergencetimesincalibrationpoormolecularphylogenies
AT kumarsudhir datadrivenspeciationtreepriorforbetterspeciesdivergencetimesincalibrationpoormolecularphylogenies