Cargando…

StarBEAST2 Brings Faster Species Tree Inference and Accurate Estimates of Substitution Rates

Fully Bayesian multispecies coalescent (MSC) methods like *BEAST estimate species trees from multiple sequence alignments. Today thousands of genes can be sequenced for a given study, but using that many genes with *BEAST is intractably slow. An alternative is to use heuristic methods which compromi...

Descripción completa

Detalles Bibliográficos
Autores principales: Ogilvie, Huw A., Bouckaert, Remco R., Drummond, Alexei J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5850801/
https://www.ncbi.nlm.nih.gov/pubmed/28431121
http://dx.doi.org/10.1093/molbev/msx126
_version_ 1783306286838841344
author Ogilvie, Huw A.
Bouckaert, Remco R.
Drummond, Alexei J.
author_facet Ogilvie, Huw A.
Bouckaert, Remco R.
Drummond, Alexei J.
author_sort Ogilvie, Huw A.
collection PubMed
description Fully Bayesian multispecies coalescent (MSC) methods like *BEAST estimate species trees from multiple sequence alignments. Today thousands of genes can be sequenced for a given study, but using that many genes with *BEAST is intractably slow. An alternative is to use heuristic methods which compromise accuracy or completeness in return for speed. A common heuristic is concatenation, which assumes that the evolutionary history of each gene tree is identical to the species tree. This is an inconsistent estimator of species tree topology, a worse estimator of divergence times, and induces spurious substitution rate variation when incomplete lineage sorting is present. Another class of heuristics directly motivated by the MSC avoids many of the pitfalls of concatenation but cannot be used to estimate divergence times. To enable fuller use of available data and more accurate inference of species tree topologies, divergence times, and substitution rates, we have developed a new version of *BEAST called StarBEAST2. To improve convergence rates we add analytical integration of population sizes, novel MCMC operators and other optimizations. Computational performance improved by 13.5× and 13.8× respectively when analyzing two empirical data sets, and an average of 33.1× across 30 simulated data sets. To enable accurate estimates of per-species substitution rates, we introduce species tree relaxed clocks, and show that StarBEAST2 is a more powerful and robust estimator of rate variation than concatenation. StarBEAST2 is available through the BEAUTi package manager in BEAST 2.4 and above.
format Online
Article
Text
id pubmed-5850801
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58508012018-03-23 StarBEAST2 Brings Faster Species Tree Inference and Accurate Estimates of Substitution Rates Ogilvie, Huw A. Bouckaert, Remco R. Drummond, Alexei J. Mol Biol Evol Resources Fully Bayesian multispecies coalescent (MSC) methods like *BEAST estimate species trees from multiple sequence alignments. Today thousands of genes can be sequenced for a given study, but using that many genes with *BEAST is intractably slow. An alternative is to use heuristic methods which compromise accuracy or completeness in return for speed. A common heuristic is concatenation, which assumes that the evolutionary history of each gene tree is identical to the species tree. This is an inconsistent estimator of species tree topology, a worse estimator of divergence times, and induces spurious substitution rate variation when incomplete lineage sorting is present. Another class of heuristics directly motivated by the MSC avoids many of the pitfalls of concatenation but cannot be used to estimate divergence times. To enable fuller use of available data and more accurate inference of species tree topologies, divergence times, and substitution rates, we have developed a new version of *BEAST called StarBEAST2. To improve convergence rates we add analytical integration of population sizes, novel MCMC operators and other optimizations. Computational performance improved by 13.5× and 13.8× respectively when analyzing two empirical data sets, and an average of 33.1× across 30 simulated data sets. To enable accurate estimates of per-species substitution rates, we introduce species tree relaxed clocks, and show that StarBEAST2 is a more powerful and robust estimator of rate variation than concatenation. StarBEAST2 is available through the BEAUTi package manager in BEAST 2.4 and above. Oxford University Press 2017-08 2017-04-14 /pmc/articles/PMC5850801/ /pubmed/28431121 http://dx.doi.org/10.1093/molbev/msx126 Text en © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Resources
Ogilvie, Huw A.
Bouckaert, Remco R.
Drummond, Alexei J.
StarBEAST2 Brings Faster Species Tree Inference and Accurate Estimates of Substitution Rates
title StarBEAST2 Brings Faster Species Tree Inference and Accurate Estimates of Substitution Rates
title_full StarBEAST2 Brings Faster Species Tree Inference and Accurate Estimates of Substitution Rates
title_fullStr StarBEAST2 Brings Faster Species Tree Inference and Accurate Estimates of Substitution Rates
title_full_unstemmed StarBEAST2 Brings Faster Species Tree Inference and Accurate Estimates of Substitution Rates
title_short StarBEAST2 Brings Faster Species Tree Inference and Accurate Estimates of Substitution Rates
title_sort starbeast2 brings faster species tree inference and accurate estimates of substitution rates
topic Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5850801/
https://www.ncbi.nlm.nih.gov/pubmed/28431121
http://dx.doi.org/10.1093/molbev/msx126
work_keys_str_mv AT ogilviehuwa starbeast2bringsfasterspeciestreeinferenceandaccurateestimatesofsubstitutionrates
AT bouckaertremcor starbeast2bringsfasterspeciestreeinferenceandaccurateestimatesofsubstitutionrates
AT drummondalexeij starbeast2bringsfasterspeciestreeinferenceandaccurateestimatesofsubstitutionrates