Cargando…

ASTRAL: genome-scale coalescent-based species tree estimation

Motivation: Species trees provide insight into basic biology, including the mechanisms of evolution and how it modifies biomolecular function and structure, biodiversity and co-evolution between genes and species. Yet, gene trees often differ from species trees, creating challenges to species tree e...

Descripción completa

Detalles Bibliográficos
Autores principales: Mirarab, S., Reaz, R., Bayzid, Md. S., Zimmermann, T., Swenson, M. S., Warnow, T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4147915/
https://www.ncbi.nlm.nih.gov/pubmed/25161245
http://dx.doi.org/10.1093/bioinformatics/btu462
_version_ 1782332536831082496
author Mirarab, S.
Reaz, R.
Bayzid, Md. S.
Zimmermann, T.
Swenson, M. S.
Warnow, T.
author_facet Mirarab, S.
Reaz, R.
Bayzid, Md. S.
Zimmermann, T.
Swenson, M. S.
Warnow, T.
author_sort Mirarab, S.
collection PubMed
description Motivation: Species trees provide insight into basic biology, including the mechanisms of evolution and how it modifies biomolecular function and structure, biodiversity and co-evolution between genes and species. Yet, gene trees often differ from species trees, creating challenges to species tree estimation. One of the most frequent causes for conflicting topologies between gene trees and species trees is incomplete lineage sorting (ILS), which is modelled by the multi-species coalescent. While many methods have been developed to estimate species trees from multiple genes, some which have statistical guarantees under the multi-species coalescent model, existing methods are too computationally intensive for use with genome-scale analyses or have been shown to have poor accuracy under some realistic conditions. Results: We present ASTRAL, a fast method for estimating species trees from multiple genes. ASTRAL is statistically consistent, can run on datasets with thousands of genes and has outstanding accuracy—improving on MP-EST and the population tree from BUCKy, two statistically consistent leading coalescent-based methods. ASTRAL is often more accurate than concatenation using maximum likelihood, except when ILS levels are low or there are too few gene trees. Availability and implementation: ASTRAL is available in open source form at https://github.com/smirarab/ASTRAL/. Datasets studied in this article are available at http://www.cs.utexas.edu/users/phylo/datasets/astral. Contact: warnow@illinois.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4147915
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-41479152014-09-02 ASTRAL: genome-scale coalescent-based species tree estimation Mirarab, S. Reaz, R. Bayzid, Md. S. Zimmermann, T. Swenson, M. S. Warnow, T. Bioinformatics Eccb 2014 Proceedings Papers Committee Motivation: Species trees provide insight into basic biology, including the mechanisms of evolution and how it modifies biomolecular function and structure, biodiversity and co-evolution between genes and species. Yet, gene trees often differ from species trees, creating challenges to species tree estimation. One of the most frequent causes for conflicting topologies between gene trees and species trees is incomplete lineage sorting (ILS), which is modelled by the multi-species coalescent. While many methods have been developed to estimate species trees from multiple genes, some which have statistical guarantees under the multi-species coalescent model, existing methods are too computationally intensive for use with genome-scale analyses or have been shown to have poor accuracy under some realistic conditions. Results: We present ASTRAL, a fast method for estimating species trees from multiple genes. ASTRAL is statistically consistent, can run on datasets with thousands of genes and has outstanding accuracy—improving on MP-EST and the population tree from BUCKy, two statistically consistent leading coalescent-based methods. ASTRAL is often more accurate than concatenation using maximum likelihood, except when ILS levels are low or there are too few gene trees. Availability and implementation: ASTRAL is available in open source form at https://github.com/smirarab/ASTRAL/. Datasets studied in this article are available at http://www.cs.utexas.edu/users/phylo/datasets/astral. Contact: warnow@illinois.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2014-09-01 2014-08-22 /pmc/articles/PMC4147915/ /pubmed/25161245 http://dx.doi.org/10.1093/bioinformatics/btu462 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Eccb 2014 Proceedings Papers Committee
Mirarab, S.
Reaz, R.
Bayzid, Md. S.
Zimmermann, T.
Swenson, M. S.
Warnow, T.
ASTRAL: genome-scale coalescent-based species tree estimation
title ASTRAL: genome-scale coalescent-based species tree estimation
title_full ASTRAL: genome-scale coalescent-based species tree estimation
title_fullStr ASTRAL: genome-scale coalescent-based species tree estimation
title_full_unstemmed ASTRAL: genome-scale coalescent-based species tree estimation
title_short ASTRAL: genome-scale coalescent-based species tree estimation
title_sort astral: genome-scale coalescent-based species tree estimation
topic Eccb 2014 Proceedings Papers Committee
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4147915/
https://www.ncbi.nlm.nih.gov/pubmed/25161245
http://dx.doi.org/10.1093/bioinformatics/btu462
work_keys_str_mv AT mirarabs astralgenomescalecoalescentbasedspeciestreeestimation
AT reazr astralgenomescalecoalescentbasedspeciestreeestimation
AT bayzidmds astralgenomescalecoalescentbasedspeciestreeestimation
AT zimmermannt astralgenomescalecoalescentbasedspeciestreeestimation
AT swensonms astralgenomescalecoalescentbasedspeciestreeestimation
AT warnowt astralgenomescalecoalescentbasedspeciestreeestimation