Cargando…

Using Robinson-Foulds supertrees in divide-and-conquer phylogeny estimation

One of the Grand Challenges in Science is the construction of the Tree of Life, an evolutionary tree containing several million species, spanning all life on earth. However, the construction of the Tree of Life is enormously computationally challenging, as all the current most accurate methods are e...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Xilin, Le, Thien, Christensen, Sarah A., Molloy, Erin K., Warnow, Tandy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8240396/
https://www.ncbi.nlm.nih.gov/pubmed/34183037
http://dx.doi.org/10.1186/s13015-021-00189-2
_version_ 1783715208461549568
author Yu, Xilin
Le, Thien
Christensen, Sarah A.
Molloy, Erin K.
Warnow, Tandy
author_facet Yu, Xilin
Le, Thien
Christensen, Sarah A.
Molloy, Erin K.
Warnow, Tandy
author_sort Yu, Xilin
collection PubMed
description One of the Grand Challenges in Science is the construction of the Tree of Life, an evolutionary tree containing several million species, spanning all life on earth. However, the construction of the Tree of Life is enormously computationally challenging, as all the current most accurate methods are either heuristics for NP-hard optimization problems or Bayesian MCMC methods that sample from tree space. One of the most promising approaches for improving scalability and accuracy for phylogeny estimation uses divide-and-conquer: a set of species is divided into overlapping subsets, trees are constructed on the subsets, and then merged together using a “supertree method”. Here, we present Exact-RFS-2, the first polynomial-time algorithm to find an optimal supertree of two trees, using the Robinson-Foulds Supertree (RFS) criterion (a major approach in supertree estimation that is related to maximum likelihood supertrees), and we prove that finding the RFS of three input trees is NP-hard. Exact-RFS-2 is available in open source form on Github at https://github.com/yuxilin51/GreedyRFS.
format Online
Article
Text
id pubmed-8240396
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-82403962021-06-30 Using Robinson-Foulds supertrees in divide-and-conquer phylogeny estimation Yu, Xilin Le, Thien Christensen, Sarah A. Molloy, Erin K. Warnow, Tandy Algorithms Mol Biol Research One of the Grand Challenges in Science is the construction of the Tree of Life, an evolutionary tree containing several million species, spanning all life on earth. However, the construction of the Tree of Life is enormously computationally challenging, as all the current most accurate methods are either heuristics for NP-hard optimization problems or Bayesian MCMC methods that sample from tree space. One of the most promising approaches for improving scalability and accuracy for phylogeny estimation uses divide-and-conquer: a set of species is divided into overlapping subsets, trees are constructed on the subsets, and then merged together using a “supertree method”. Here, we present Exact-RFS-2, the first polynomial-time algorithm to find an optimal supertree of two trees, using the Robinson-Foulds Supertree (RFS) criterion (a major approach in supertree estimation that is related to maximum likelihood supertrees), and we prove that finding the RFS of three input trees is NP-hard. Exact-RFS-2 is available in open source form on Github at https://github.com/yuxilin51/GreedyRFS. BioMed Central 2021-06-28 /pmc/articles/PMC8240396/ /pubmed/34183037 http://dx.doi.org/10.1186/s13015-021-00189-2 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Yu, Xilin
Le, Thien
Christensen, Sarah A.
Molloy, Erin K.
Warnow, Tandy
Using Robinson-Foulds supertrees in divide-and-conquer phylogeny estimation
title Using Robinson-Foulds supertrees in divide-and-conquer phylogeny estimation
title_full Using Robinson-Foulds supertrees in divide-and-conquer phylogeny estimation
title_fullStr Using Robinson-Foulds supertrees in divide-and-conquer phylogeny estimation
title_full_unstemmed Using Robinson-Foulds supertrees in divide-and-conquer phylogeny estimation
title_short Using Robinson-Foulds supertrees in divide-and-conquer phylogeny estimation
title_sort using robinson-foulds supertrees in divide-and-conquer phylogeny estimation
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8240396/
https://www.ncbi.nlm.nih.gov/pubmed/34183037
http://dx.doi.org/10.1186/s13015-021-00189-2
work_keys_str_mv AT yuxilin usingrobinsonfouldssupertreesindivideandconquerphylogenyestimation
AT lethien usingrobinsonfouldssupertreesindivideandconquerphylogenyestimation
AT christensensaraha usingrobinsonfouldssupertreesindivideandconquerphylogenyestimation
AT molloyerink usingrobinsonfouldssupertreesindivideandconquerphylogenyestimation
AT warnowtandy usingrobinsonfouldssupertreesindivideandconquerphylogenyestimation