Cargando…

Clock model makes a large difference to age estimates of long-stemmed clades with no internal calibration: a test using Australian grasstrees

BACKGROUND: Estimating divergence times in phylogenies using a molecular clock depends on accurate modeling of nucleotide substitution rates in DNA sequences. Rate heterogeneity among lineages is likely to affect estimates, especially in lineages with long stems and short crowns (“broom” clades) and...

Descripción completa

Detalles Bibliográficos
Autores principales: Crisp, Michael D, Hardy, Nate B, Cook, Lyn G
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4279595/
https://www.ncbi.nlm.nih.gov/pubmed/25523814
http://dx.doi.org/10.1186/s12862-014-0263-3
Descripción
Sumario:BACKGROUND: Estimating divergence times in phylogenies using a molecular clock depends on accurate modeling of nucleotide substitution rates in DNA sequences. Rate heterogeneity among lineages is likely to affect estimates, especially in lineages with long stems and short crowns (“broom” clades) and no internal calibration. We evaluate the performance of the random local clocks model (RLC) and the more routinely employed uncorrelated lognormal relaxed clock model (UCLN) in situations in which a significant rate shift occurs on the stem branch of a broom clade. We compare the results of simulations to empirical results from analyses of a real rate-heterogeneous taxon – Australian grass trees (Xanthorrhoea) – whose substitution rate is slower than in its sister groups, as determined by relative rate tests. RESULTS: In the simulated datasets, the RLC model performed much better than UCLN: RLC correctly estimated the age of the crown node of slow-rate broom clades, whereas UCLN estimates were consistently too young. Similarly, in the Xanthorrhoea dataset, UCLN returned significantly younger crown ages than RLC (mean estimates respectively 3–6 Ma versus 25–35 Ma). In both real and simulated datasets, Bayes Factor tests strongly favored the RLC model over the UCLN model. CONCLUSIONS: The choice of an unsuitable molecular clock model can strongly bias divergence time estimates. In particular, for data predicted to have more rate variation among than within clades, dating with RLC is much more likely to be accurate than with UCLN. The choice of clocks should be informed by the biology of the study group (e.g., life-form) or assessed with relative rate tests and post-hoc model comparisons. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12862-014-0263-3) contains supplementary material, which is available to authorized users.