Cargando…
ASTRAL: genome-scale coalescent-based species tree estimation
Motivation: Species trees provide insight into basic biology, including the mechanisms of evolution and how it modifies biomolecular function and structure, biodiversity and co-evolution between genes and species. Yet, gene trees often differ from species trees, creating challenges to species tree e...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4147915/ https://www.ncbi.nlm.nih.gov/pubmed/25161245 http://dx.doi.org/10.1093/bioinformatics/btu462 |
_version_ | 1782332536831082496 |
---|---|
author | Mirarab, S. Reaz, R. Bayzid, Md. S. Zimmermann, T. Swenson, M. S. Warnow, T. |
author_facet | Mirarab, S. Reaz, R. Bayzid, Md. S. Zimmermann, T. Swenson, M. S. Warnow, T. |
author_sort | Mirarab, S. |
collection | PubMed |
description | Motivation: Species trees provide insight into basic biology, including the mechanisms of evolution and how it modifies biomolecular function and structure, biodiversity and co-evolution between genes and species. Yet, gene trees often differ from species trees, creating challenges to species tree estimation. One of the most frequent causes for conflicting topologies between gene trees and species trees is incomplete lineage sorting (ILS), which is modelled by the multi-species coalescent. While many methods have been developed to estimate species trees from multiple genes, some which have statistical guarantees under the multi-species coalescent model, existing methods are too computationally intensive for use with genome-scale analyses or have been shown to have poor accuracy under some realistic conditions. Results: We present ASTRAL, a fast method for estimating species trees from multiple genes. ASTRAL is statistically consistent, can run on datasets with thousands of genes and has outstanding accuracy—improving on MP-EST and the population tree from BUCKy, two statistically consistent leading coalescent-based methods. ASTRAL is often more accurate than concatenation using maximum likelihood, except when ILS levels are low or there are too few gene trees. Availability and implementation: ASTRAL is available in open source form at https://github.com/smirarab/ASTRAL/. Datasets studied in this article are available at http://www.cs.utexas.edu/users/phylo/datasets/astral. Contact: warnow@illinois.edu Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-4147915 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-41479152014-09-02 ASTRAL: genome-scale coalescent-based species tree estimation Mirarab, S. Reaz, R. Bayzid, Md. S. Zimmermann, T. Swenson, M. S. Warnow, T. Bioinformatics Eccb 2014 Proceedings Papers Committee Motivation: Species trees provide insight into basic biology, including the mechanisms of evolution and how it modifies biomolecular function and structure, biodiversity and co-evolution between genes and species. Yet, gene trees often differ from species trees, creating challenges to species tree estimation. One of the most frequent causes for conflicting topologies between gene trees and species trees is incomplete lineage sorting (ILS), which is modelled by the multi-species coalescent. While many methods have been developed to estimate species trees from multiple genes, some which have statistical guarantees under the multi-species coalescent model, existing methods are too computationally intensive for use with genome-scale analyses or have been shown to have poor accuracy under some realistic conditions. Results: We present ASTRAL, a fast method for estimating species trees from multiple genes. ASTRAL is statistically consistent, can run on datasets with thousands of genes and has outstanding accuracy—improving on MP-EST and the population tree from BUCKy, two statistically consistent leading coalescent-based methods. ASTRAL is often more accurate than concatenation using maximum likelihood, except when ILS levels are low or there are too few gene trees. Availability and implementation: ASTRAL is available in open source form at https://github.com/smirarab/ASTRAL/. Datasets studied in this article are available at http://www.cs.utexas.edu/users/phylo/datasets/astral. Contact: warnow@illinois.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2014-09-01 2014-08-22 /pmc/articles/PMC4147915/ /pubmed/25161245 http://dx.doi.org/10.1093/bioinformatics/btu462 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Eccb 2014 Proceedings Papers Committee Mirarab, S. Reaz, R. Bayzid, Md. S. Zimmermann, T. Swenson, M. S. Warnow, T. ASTRAL: genome-scale coalescent-based species tree estimation |
title | ASTRAL: genome-scale coalescent-based species tree estimation |
title_full | ASTRAL: genome-scale coalescent-based species tree estimation |
title_fullStr | ASTRAL: genome-scale coalescent-based species tree estimation |
title_full_unstemmed | ASTRAL: genome-scale coalescent-based species tree estimation |
title_short | ASTRAL: genome-scale coalescent-based species tree estimation |
title_sort | astral: genome-scale coalescent-based species tree estimation |
topic | Eccb 2014 Proceedings Papers Committee |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4147915/ https://www.ncbi.nlm.nih.gov/pubmed/25161245 http://dx.doi.org/10.1093/bioinformatics/btu462 |
work_keys_str_mv | AT mirarabs astralgenomescalecoalescentbasedspeciestreeestimation AT reazr astralgenomescalecoalescentbasedspeciestreeestimation AT bayzidmds astralgenomescalecoalescentbasedspeciestreeestimation AT zimmermannt astralgenomescalecoalescentbasedspeciestreeestimation AT swensonms astralgenomescalecoalescentbasedspeciestreeestimation AT warnowt astralgenomescalecoalescentbasedspeciestreeestimation |