Cargando…

ASTRID: Accurate Species TRees from Internode Distances

BACKGROUND: Incomplete lineage sorting (ILS), modelled by the multi-species coalescent (MSC), is known to create discordance between gene trees and species trees, and lead to inaccurate species tree estimations unless appropriate methods are used to estimate the species tree. While many statisticall...

Descripción completa

Detalles Bibliográficos
Autores principales: Vachaspati, Pranjal, Warnow, Tandy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4602181/
https://www.ncbi.nlm.nih.gov/pubmed/26449326
http://dx.doi.org/10.1186/1471-2164-16-S10-S3
_version_ 1782394666960814080
author Vachaspati, Pranjal
Warnow, Tandy
author_facet Vachaspati, Pranjal
Warnow, Tandy
author_sort Vachaspati, Pranjal
collection PubMed
description BACKGROUND: Incomplete lineage sorting (ILS), modelled by the multi-species coalescent (MSC), is known to create discordance between gene trees and species trees, and lead to inaccurate species tree estimations unless appropriate methods are used to estimate the species tree. While many statistically consistent methods have been developed to estimate the species tree in the presence of ILS, only ASTRAL-2 and NJst have been shown to have good accuracy on large datasets. Yet, NJst is generally slower and less accurate than ASTRAL-2, and cannot run on some datasets. RESULTS: We have redesigned NJst to enable it to run on all datasets, and we have expanded its design space so that it can be used with different distance-based tree estimation methods. The resultant method, ASTRID, is statistically consistent under the MSC model, and has accuracy that is competitive with ASTRAL-2. Furthermore, ASTRID is much faster than ASTRAL-2, completing in minutes on some datasets for which ASTRAL-2 used hours. CONCLUSIONS: ASTRID is a new coalescent-based method for species tree estimation that is competitive with the best current method in terms of accuracy, while being much faster. ASTRID is available in open source form on github.
format Online
Article
Text
id pubmed-4602181
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46021812015-10-13 ASTRID: Accurate Species TRees from Internode Distances Vachaspati, Pranjal Warnow, Tandy BMC Genomics Research BACKGROUND: Incomplete lineage sorting (ILS), modelled by the multi-species coalescent (MSC), is known to create discordance between gene trees and species trees, and lead to inaccurate species tree estimations unless appropriate methods are used to estimate the species tree. While many statistically consistent methods have been developed to estimate the species tree in the presence of ILS, only ASTRAL-2 and NJst have been shown to have good accuracy on large datasets. Yet, NJst is generally slower and less accurate than ASTRAL-2, and cannot run on some datasets. RESULTS: We have redesigned NJst to enable it to run on all datasets, and we have expanded its design space so that it can be used with different distance-based tree estimation methods. The resultant method, ASTRID, is statistically consistent under the MSC model, and has accuracy that is competitive with ASTRAL-2. Furthermore, ASTRID is much faster than ASTRAL-2, completing in minutes on some datasets for which ASTRAL-2 used hours. CONCLUSIONS: ASTRID is a new coalescent-based method for species tree estimation that is competitive with the best current method in terms of accuracy, while being much faster. ASTRID is available in open source form on github. BioMed Central 2015-10-02 /pmc/articles/PMC4602181/ /pubmed/26449326 http://dx.doi.org/10.1186/1471-2164-16-S10-S3 Text en Copyright © 2015 Vachaspati and Warnow http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Vachaspati, Pranjal
Warnow, Tandy
ASTRID: Accurate Species TRees from Internode Distances
title ASTRID: Accurate Species TRees from Internode Distances
title_full ASTRID: Accurate Species TRees from Internode Distances
title_fullStr ASTRID: Accurate Species TRees from Internode Distances
title_full_unstemmed ASTRID: Accurate Species TRees from Internode Distances
title_short ASTRID: Accurate Species TRees from Internode Distances
title_sort astrid: accurate species trees from internode distances
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4602181/
https://www.ncbi.nlm.nih.gov/pubmed/26449326
http://dx.doi.org/10.1186/1471-2164-16-S10-S3
work_keys_str_mv AT vachaspatipranjal astridaccuratespeciestreesfrominternodedistances
AT warnowtandy astridaccuratespeciestreesfrominternodedistances