Cargando…
ASTRAL-II: coalescent-based species tree estimation with many hundreds of taxa and thousands of genes
Motivation: The estimation of species phylogenies requires multiple loci, since different loci can have different trees due to incomplete lineage sorting, modeled by the multi-species coalescent model. We recently developed a coalescent-based method, ASTRAL, which is statistically consistent under t...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4765870/ https://www.ncbi.nlm.nih.gov/pubmed/26072508 http://dx.doi.org/10.1093/bioinformatics/btv234 |
_version_ | 1782417586692030464 |
---|---|
author | Mirarab, Siavash Warnow, Tandy |
author_facet | Mirarab, Siavash Warnow, Tandy |
author_sort | Mirarab, Siavash |
collection | PubMed |
description | Motivation: The estimation of species phylogenies requires multiple loci, since different loci can have different trees due to incomplete lineage sorting, modeled by the multi-species coalescent model. We recently developed a coalescent-based method, ASTRAL, which is statistically consistent under the multi-species coalescent model and which is more accurate than other coalescent-based methods on the datasets we examined. ASTRAL runs in polynomial time, by constraining the search space using a set of allowed ‘bipartitions’. Despite the limitation to allowed bipartitions, ASTRAL is statistically consistent. Results: We present a new version of ASTRAL, which we call ASTRAL-II. We show that ASTRAL-II has substantial advantages over ASTRAL: it is faster, can analyze much larger datasets (up to 1000 species and 1000 genes) and has substantially better accuracy under some conditions. ASTRAL’s running time is [Formula: see text] , and ASTRAL-II’s running time is [Formula: see text] , where n is the number of species, k is the number of loci and X is the set of allowed bipartitions for the search space. Availability and implementation: ASTRAL-II is available in open source at https://github.com/smirarab/ASTRAL and datasets used are available at http://www.cs.utexas.edu/~phylo/datasets/astral2/. Contact: smirarab@gmail.com Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-4765870 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-47658702016-03-04 ASTRAL-II: coalescent-based species tree estimation with many hundreds of taxa and thousands of genes Mirarab, Siavash Warnow, Tandy Bioinformatics Ismb/Eccb 2015 Proceedings Papers Committee July 10 to July 14, 2015, Dublin, Ireland Motivation: The estimation of species phylogenies requires multiple loci, since different loci can have different trees due to incomplete lineage sorting, modeled by the multi-species coalescent model. We recently developed a coalescent-based method, ASTRAL, which is statistically consistent under the multi-species coalescent model and which is more accurate than other coalescent-based methods on the datasets we examined. ASTRAL runs in polynomial time, by constraining the search space using a set of allowed ‘bipartitions’. Despite the limitation to allowed bipartitions, ASTRAL is statistically consistent. Results: We present a new version of ASTRAL, which we call ASTRAL-II. We show that ASTRAL-II has substantial advantages over ASTRAL: it is faster, can analyze much larger datasets (up to 1000 species and 1000 genes) and has substantially better accuracy under some conditions. ASTRAL’s running time is [Formula: see text] , and ASTRAL-II’s running time is [Formula: see text] , where n is the number of species, k is the number of loci and X is the set of allowed bipartitions for the search space. Availability and implementation: ASTRAL-II is available in open source at https://github.com/smirarab/ASTRAL and datasets used are available at http://www.cs.utexas.edu/~phylo/datasets/astral2/. Contact: smirarab@gmail.com Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2015-06-15 2015-06-10 /pmc/articles/PMC4765870/ /pubmed/26072508 http://dx.doi.org/10.1093/bioinformatics/btv234 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Ismb/Eccb 2015 Proceedings Papers Committee July 10 to July 14, 2015, Dublin, Ireland Mirarab, Siavash Warnow, Tandy ASTRAL-II: coalescent-based species tree estimation with many hundreds of taxa and thousands of genes |
title | ASTRAL-II: coalescent-based species tree estimation with many hundreds of taxa and thousands of genes |
title_full | ASTRAL-II: coalescent-based species tree estimation with many hundreds of taxa and thousands of genes |
title_fullStr | ASTRAL-II: coalescent-based species tree estimation with many hundreds of taxa and thousands of genes |
title_full_unstemmed | ASTRAL-II: coalescent-based species tree estimation with many hundreds of taxa and thousands of genes |
title_short | ASTRAL-II: coalescent-based species tree estimation with many hundreds of taxa and thousands of genes |
title_sort | astral-ii: coalescent-based species tree estimation with many hundreds of taxa and thousands of genes |
topic | Ismb/Eccb 2015 Proceedings Papers Committee July 10 to July 14, 2015, Dublin, Ireland |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4765870/ https://www.ncbi.nlm.nih.gov/pubmed/26072508 http://dx.doi.org/10.1093/bioinformatics/btv234 |
work_keys_str_mv | AT mirarabsiavash astraliicoalescentbasedspeciestreeestimationwithmanyhundredsoftaxaandthousandsofgenes AT warnowtandy astraliicoalescentbasedspeciestreeestimationwithmanyhundredsoftaxaandthousandsofgenes |