Cargando…
The Impact of the Rate Prior on Bayesian Estimation of Divergence Times with Multiple Loci
Bayesian methods provide a powerful way to estimate species divergence times by combining information from molecular sequences with information from the fossil record. With the explosive increase of genomic data, divergence time estimation increasingly uses data of multiple loci (genes or site parti...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4055871/ https://www.ncbi.nlm.nih.gov/pubmed/24658316 http://dx.doi.org/10.1093/sysbio/syu020 |
_version_ | 1782320742609715200 |
---|---|
author | Dos Reis, Mario Zhu, Tianqi Yang, Ziheng |
author_facet | Dos Reis, Mario Zhu, Tianqi Yang, Ziheng |
author_sort | Dos Reis, Mario |
collection | PubMed |
description | Bayesian methods provide a powerful way to estimate species divergence times by combining information from molecular sequences with information from the fossil record. With the explosive increase of genomic data, divergence time estimation increasingly uses data of multiple loci (genes or site partitions). Widely used computer programs to estimate divergence times use independent and identically distributed (i.i.d.) priors on the substitution rates for different loci. The i.i.d. prior is problematic. As the number of loci (L) increases, the prior variance of the average rate across all loci goes to zero at the rate 1/L. As a consequence, the rate prior dominates posterior time estimates when many loci are analyzed, and if the rate prior is misspecified, the estimated divergence times will converge to wrong values with very narrow credibility intervals. Here we develop a new prior on the locus rates based on the Dirichlet distribution that corrects the problematic behavior of the i.i.d. prior. We use computer simulation and real data analysis to highlight the differences between the old and new priors. For a dataset for six primate species, we show that with the old i.i.d. prior, if the prior rate is too high (or too low), the estimated divergence times are too young (or too old), outside the bounds imposed by the fossil calibrations. In contrast, with the new Dirichlet prior, posterior time estimates are insensitive to the rate prior and are compatible with the fossil calibrations. We re-analyzed a phylogenomic data set of 36 mammal species and show that using many fossil calibrations can alleviate the adverse impact of a misspecified rate prior to some extent. We recommend the use of the new Dirichlet prior in Bayesian divergence time estimation. [Bayesian inference, divergence time, relaxed clock, rate prior, partition analysis.] |
format | Online Article Text |
id | pubmed-4055871 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-40558712014-06-13 The Impact of the Rate Prior on Bayesian Estimation of Divergence Times with Multiple Loci Dos Reis, Mario Zhu, Tianqi Yang, Ziheng Syst Biol Regular Articles Bayesian methods provide a powerful way to estimate species divergence times by combining information from molecular sequences with information from the fossil record. With the explosive increase of genomic data, divergence time estimation increasingly uses data of multiple loci (genes or site partitions). Widely used computer programs to estimate divergence times use independent and identically distributed (i.i.d.) priors on the substitution rates for different loci. The i.i.d. prior is problematic. As the number of loci (L) increases, the prior variance of the average rate across all loci goes to zero at the rate 1/L. As a consequence, the rate prior dominates posterior time estimates when many loci are analyzed, and if the rate prior is misspecified, the estimated divergence times will converge to wrong values with very narrow credibility intervals. Here we develop a new prior on the locus rates based on the Dirichlet distribution that corrects the problematic behavior of the i.i.d. prior. We use computer simulation and real data analysis to highlight the differences between the old and new priors. For a dataset for six primate species, we show that with the old i.i.d. prior, if the prior rate is too high (or too low), the estimated divergence times are too young (or too old), outside the bounds imposed by the fossil calibrations. In contrast, with the new Dirichlet prior, posterior time estimates are insensitive to the rate prior and are compatible with the fossil calibrations. We re-analyzed a phylogenomic data set of 36 mammal species and show that using many fossil calibrations can alleviate the adverse impact of a misspecified rate prior to some extent. We recommend the use of the new Dirichlet prior in Bayesian divergence time estimation. [Bayesian inference, divergence time, relaxed clock, rate prior, partition analysis.] Oxford University Press 2014-07 2014-03-21 /pmc/articles/PMC4055871/ /pubmed/24658316 http://dx.doi.org/10.1093/sysbio/syu020 Text en © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Regular Articles Dos Reis, Mario Zhu, Tianqi Yang, Ziheng The Impact of the Rate Prior on Bayesian Estimation of Divergence Times with Multiple Loci |
title | The Impact of the Rate Prior on Bayesian Estimation of Divergence Times with Multiple Loci |
title_full | The Impact of the Rate Prior on Bayesian Estimation of Divergence Times with Multiple Loci |
title_fullStr | The Impact of the Rate Prior on Bayesian Estimation of Divergence Times with Multiple Loci |
title_full_unstemmed | The Impact of the Rate Prior on Bayesian Estimation of Divergence Times with Multiple Loci |
title_short | The Impact of the Rate Prior on Bayesian Estimation of Divergence Times with Multiple Loci |
title_sort | impact of the rate prior on bayesian estimation of divergence times with multiple loci |
topic | Regular Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4055871/ https://www.ncbi.nlm.nih.gov/pubmed/24658316 http://dx.doi.org/10.1093/sysbio/syu020 |
work_keys_str_mv | AT dosreismario theimpactoftherateprioronbayesianestimationofdivergencetimeswithmultipleloci AT zhutianqi theimpactoftherateprioronbayesianestimationofdivergencetimeswithmultipleloci AT yangziheng theimpactoftherateprioronbayesianestimationofdivergencetimeswithmultipleloci AT dosreismario impactoftherateprioronbayesianestimationofdivergencetimeswithmultipleloci AT zhutianqi impactoftherateprioronbayesianestimationofdivergencetimeswithmultipleloci AT yangziheng impactoftherateprioronbayesianestimationofdivergencetimeswithmultipleloci |