Cargando…
TreeMerge: a new method for improving the scalability of species tree estimation methods
MOTIVATION: At RECOMB-CG 2018, we presented NJMerge and showed that it could be used within a divide-and-conquer framework to scale computationally intensive methods for species tree estimation to larger datasets. However, NJMerge has two significant limitations: it can fail to return a tree and, wh...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612878/ https://www.ncbi.nlm.nih.gov/pubmed/31510668 http://dx.doi.org/10.1093/bioinformatics/btz344 |
_version_ | 1783432957316300800 |
---|---|
author | Molloy, Erin K Warnow, Tandy |
author_facet | Molloy, Erin K Warnow, Tandy |
author_sort | Molloy, Erin K |
collection | PubMed |
description | MOTIVATION: At RECOMB-CG 2018, we presented NJMerge and showed that it could be used within a divide-and-conquer framework to scale computationally intensive methods for species tree estimation to larger datasets. However, NJMerge has two significant limitations: it can fail to return a tree and, when used within the proposed divide-and-conquer framework, has O(n(5)) running time for datasets with n species. RESULTS: Here we present a new method called ‘TreeMerge’ that improves on NJMerge in two ways: it is guaranteed to return a tree and it has dramatically faster running time within the same divide-and-conquer framework—only O(n(2)) time. We use a simulation study to evaluate TreeMerge in the context of multi-locus species tree estimation with two leading methods, ASTRAL-III and RAxML. We find that the divide-and-conquer framework using TreeMerge has a minor impact on species tree accuracy, dramatically reduces running time, and enables both ASTRAL-III and RAxML to complete on datasets (that they would otherwise fail on), when given 64 GB of memory and 48 h maximum running time. Thus, TreeMerge is a step toward a larger vision of enabling researchers with limited computational resources to perform large-scale species tree estimation, which we call Phylogenomics for All. AVAILABILITY AND IMPLEMENTATION: TreeMerge is publicly available on Github (http://github.com/ekmolloy/treemerge). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-6612878 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-66128782019-07-12 TreeMerge: a new method for improving the scalability of species tree estimation methods Molloy, Erin K Warnow, Tandy Bioinformatics Ismb/Eccb 2019 Conference Proceedings MOTIVATION: At RECOMB-CG 2018, we presented NJMerge and showed that it could be used within a divide-and-conquer framework to scale computationally intensive methods for species tree estimation to larger datasets. However, NJMerge has two significant limitations: it can fail to return a tree and, when used within the proposed divide-and-conquer framework, has O(n(5)) running time for datasets with n species. RESULTS: Here we present a new method called ‘TreeMerge’ that improves on NJMerge in two ways: it is guaranteed to return a tree and it has dramatically faster running time within the same divide-and-conquer framework—only O(n(2)) time. We use a simulation study to evaluate TreeMerge in the context of multi-locus species tree estimation with two leading methods, ASTRAL-III and RAxML. We find that the divide-and-conquer framework using TreeMerge has a minor impact on species tree accuracy, dramatically reduces running time, and enables both ASTRAL-III and RAxML to complete on datasets (that they would otherwise fail on), when given 64 GB of memory and 48 h maximum running time. Thus, TreeMerge is a step toward a larger vision of enabling researchers with limited computational resources to perform large-scale species tree estimation, which we call Phylogenomics for All. AVAILABILITY AND IMPLEMENTATION: TreeMerge is publicly available on Github (http://github.com/ekmolloy/treemerge). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-07 2019-07-05 /pmc/articles/PMC6612878/ /pubmed/31510668 http://dx.doi.org/10.1093/bioinformatics/btz344 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Ismb/Eccb 2019 Conference Proceedings Molloy, Erin K Warnow, Tandy TreeMerge: a new method for improving the scalability of species tree estimation methods |
title | TreeMerge: a new method for improving the scalability of species tree estimation methods |
title_full | TreeMerge: a new method for improving the scalability of species tree estimation methods |
title_fullStr | TreeMerge: a new method for improving the scalability of species tree estimation methods |
title_full_unstemmed | TreeMerge: a new method for improving the scalability of species tree estimation methods |
title_short | TreeMerge: a new method for improving the scalability of species tree estimation methods |
title_sort | treemerge: a new method for improving the scalability of species tree estimation methods |
topic | Ismb/Eccb 2019 Conference Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612878/ https://www.ncbi.nlm.nih.gov/pubmed/31510668 http://dx.doi.org/10.1093/bioinformatics/btz344 |
work_keys_str_mv | AT molloyerink treemergeanewmethodforimprovingthescalabilityofspeciestreeestimationmethods AT warnowtandy treemergeanewmethodforimprovingthescalabilityofspeciestreeestimationmethods |