Cargando…
StarBEAST2 Brings Faster Species Tree Inference and Accurate Estimates of Substitution Rates
Fully Bayesian multispecies coalescent (MSC) methods like *BEAST estimate species trees from multiple sequence alignments. Today thousands of genes can be sequenced for a given study, but using that many genes with *BEAST is intractably slow. An alternative is to use heuristic methods which compromi...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5850801/ https://www.ncbi.nlm.nih.gov/pubmed/28431121 http://dx.doi.org/10.1093/molbev/msx126 |
_version_ | 1783306286838841344 |
---|---|
author | Ogilvie, Huw A. Bouckaert, Remco R. Drummond, Alexei J. |
author_facet | Ogilvie, Huw A. Bouckaert, Remco R. Drummond, Alexei J. |
author_sort | Ogilvie, Huw A. |
collection | PubMed |
description | Fully Bayesian multispecies coalescent (MSC) methods like *BEAST estimate species trees from multiple sequence alignments. Today thousands of genes can be sequenced for a given study, but using that many genes with *BEAST is intractably slow. An alternative is to use heuristic methods which compromise accuracy or completeness in return for speed. A common heuristic is concatenation, which assumes that the evolutionary history of each gene tree is identical to the species tree. This is an inconsistent estimator of species tree topology, a worse estimator of divergence times, and induces spurious substitution rate variation when incomplete lineage sorting is present. Another class of heuristics directly motivated by the MSC avoids many of the pitfalls of concatenation but cannot be used to estimate divergence times. To enable fuller use of available data and more accurate inference of species tree topologies, divergence times, and substitution rates, we have developed a new version of *BEAST called StarBEAST2. To improve convergence rates we add analytical integration of population sizes, novel MCMC operators and other optimizations. Computational performance improved by 13.5× and 13.8× respectively when analyzing two empirical data sets, and an average of 33.1× across 30 simulated data sets. To enable accurate estimates of per-species substitution rates, we introduce species tree relaxed clocks, and show that StarBEAST2 is a more powerful and robust estimator of rate variation than concatenation. StarBEAST2 is available through the BEAUTi package manager in BEAST 2.4 and above. |
format | Online Article Text |
id | pubmed-5850801 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-58508012018-03-23 StarBEAST2 Brings Faster Species Tree Inference and Accurate Estimates of Substitution Rates Ogilvie, Huw A. Bouckaert, Remco R. Drummond, Alexei J. Mol Biol Evol Resources Fully Bayesian multispecies coalescent (MSC) methods like *BEAST estimate species trees from multiple sequence alignments. Today thousands of genes can be sequenced for a given study, but using that many genes with *BEAST is intractably slow. An alternative is to use heuristic methods which compromise accuracy or completeness in return for speed. A common heuristic is concatenation, which assumes that the evolutionary history of each gene tree is identical to the species tree. This is an inconsistent estimator of species tree topology, a worse estimator of divergence times, and induces spurious substitution rate variation when incomplete lineage sorting is present. Another class of heuristics directly motivated by the MSC avoids many of the pitfalls of concatenation but cannot be used to estimate divergence times. To enable fuller use of available data and more accurate inference of species tree topologies, divergence times, and substitution rates, we have developed a new version of *BEAST called StarBEAST2. To improve convergence rates we add analytical integration of population sizes, novel MCMC operators and other optimizations. Computational performance improved by 13.5× and 13.8× respectively when analyzing two empirical data sets, and an average of 33.1× across 30 simulated data sets. To enable accurate estimates of per-species substitution rates, we introduce species tree relaxed clocks, and show that StarBEAST2 is a more powerful and robust estimator of rate variation than concatenation. StarBEAST2 is available through the BEAUTi package manager in BEAST 2.4 and above. Oxford University Press 2017-08 2017-04-14 /pmc/articles/PMC5850801/ /pubmed/28431121 http://dx.doi.org/10.1093/molbev/msx126 Text en © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Resources Ogilvie, Huw A. Bouckaert, Remco R. Drummond, Alexei J. StarBEAST2 Brings Faster Species Tree Inference and Accurate Estimates of Substitution Rates |
title | StarBEAST2 Brings Faster Species Tree Inference and Accurate Estimates of Substitution Rates |
title_full | StarBEAST2 Brings Faster Species Tree Inference and Accurate Estimates of Substitution Rates |
title_fullStr | StarBEAST2 Brings Faster Species Tree Inference and Accurate Estimates of Substitution Rates |
title_full_unstemmed | StarBEAST2 Brings Faster Species Tree Inference and Accurate Estimates of Substitution Rates |
title_short | StarBEAST2 Brings Faster Species Tree Inference and Accurate Estimates of Substitution Rates |
title_sort | starbeast2 brings faster species tree inference and accurate estimates of substitution rates |
topic | Resources |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5850801/ https://www.ncbi.nlm.nih.gov/pubmed/28431121 http://dx.doi.org/10.1093/molbev/msx126 |
work_keys_str_mv | AT ogilviehuwa starbeast2bringsfasterspeciestreeinferenceandaccurateestimatesofsubstitutionrates AT bouckaertremcor starbeast2bringsfasterspeciestreeinferenceandaccurateestimatesofsubstitutionrates AT drummondalexeij starbeast2bringsfasterspeciestreeinferenceandaccurateestimatesofsubstitutionrates |