Cargando…
Bad Clade Deletion Supertrees: A Fast and Accurate Supertree Algorithm
Supertree methods merge a set of overlapping phylogenetic trees into a supertree containing all taxa of the input trees. The challenge in supertree reconstruction is the way of dealing with conflicting information in the input trees. Many different algorithms for different objective functions have b...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5850620/ https://www.ncbi.nlm.nih.gov/pubmed/28873954 http://dx.doi.org/10.1093/molbev/msx191 |
_version_ | 1783306255411970048 |
---|---|
author | Fleischauer, Markus Böcker, Sebastian |
author_facet | Fleischauer, Markus Böcker, Sebastian |
author_sort | Fleischauer, Markus |
collection | PubMed |
description | Supertree methods merge a set of overlapping phylogenetic trees into a supertree containing all taxa of the input trees. The challenge in supertree reconstruction is the way of dealing with conflicting information in the input trees. Many different algorithms for different objective functions have been suggested to resolve these conflicts. In particular, there exist methods based on encoding the source trees in a matrix, where the supertree is constructed applying a local search heuristic to optimize the respective objective function. We present a novel heuristic supertree algorithm called Bad Clade Deletion (BCD) supertrees. It uses minimum cuts to delete a locally minimal number of columns from such a matrix representation so that it is compatible. This is the complement problem to Matrix Representation with Compatibility (Maximum Split Fit). Our algorithm has guaranteed polynomial worst-case running time and performs swiftly in practice. Different from local search heuristics, it guarantees to return the directed perfect phylogeny for the input matrix, corresponding to the parent tree of the input trees, if one exists. Comparing supertrees to model trees for simulated data, BCD shows a better accuracy (F(1) score) than the state-of-the-art algorithms SuperFine (up to 3%) and Matrix Representation with Parsimony (up to 7%); at the same time, BCD is up to 7 times faster than SuperFine, and up to 600 times faster than Matrix Representation with Parsimony. Finally, using the BCD supertree as a starting tree for a combined Maximum Likelihood analysis using RAxML, we reach significantly improved accuracy (1% higher F(1) score) and running time (1.7-fold speedup). |
format | Online Article Text |
id | pubmed-5850620 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-58506202018-03-23 Bad Clade Deletion Supertrees: A Fast and Accurate Supertree Algorithm Fleischauer, Markus Böcker, Sebastian Mol Biol Evol Methods Supertree methods merge a set of overlapping phylogenetic trees into a supertree containing all taxa of the input trees. The challenge in supertree reconstruction is the way of dealing with conflicting information in the input trees. Many different algorithms for different objective functions have been suggested to resolve these conflicts. In particular, there exist methods based on encoding the source trees in a matrix, where the supertree is constructed applying a local search heuristic to optimize the respective objective function. We present a novel heuristic supertree algorithm called Bad Clade Deletion (BCD) supertrees. It uses minimum cuts to delete a locally minimal number of columns from such a matrix representation so that it is compatible. This is the complement problem to Matrix Representation with Compatibility (Maximum Split Fit). Our algorithm has guaranteed polynomial worst-case running time and performs swiftly in practice. Different from local search heuristics, it guarantees to return the directed perfect phylogeny for the input matrix, corresponding to the parent tree of the input trees, if one exists. Comparing supertrees to model trees for simulated data, BCD shows a better accuracy (F(1) score) than the state-of-the-art algorithms SuperFine (up to 3%) and Matrix Representation with Parsimony (up to 7%); at the same time, BCD is up to 7 times faster than SuperFine, and up to 600 times faster than Matrix Representation with Parsimony. Finally, using the BCD supertree as a starting tree for a combined Maximum Likelihood analysis using RAxML, we reach significantly improved accuracy (1% higher F(1) score) and running time (1.7-fold speedup). Oxford University Press 2017-09 2017-07-05 /pmc/articles/PMC5850620/ /pubmed/28873954 http://dx.doi.org/10.1093/molbev/msx191 Text en © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Methods Fleischauer, Markus Böcker, Sebastian Bad Clade Deletion Supertrees: A Fast and Accurate Supertree Algorithm |
title | Bad Clade Deletion Supertrees: A Fast and Accurate Supertree Algorithm |
title_full | Bad Clade Deletion Supertrees: A Fast and Accurate Supertree Algorithm |
title_fullStr | Bad Clade Deletion Supertrees: A Fast and Accurate Supertree Algorithm |
title_full_unstemmed | Bad Clade Deletion Supertrees: A Fast and Accurate Supertree Algorithm |
title_short | Bad Clade Deletion Supertrees: A Fast and Accurate Supertree Algorithm |
title_sort | bad clade deletion supertrees: a fast and accurate supertree algorithm |
topic | Methods |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5850620/ https://www.ncbi.nlm.nih.gov/pubmed/28873954 http://dx.doi.org/10.1093/molbev/msx191 |
work_keys_str_mv | AT fleischauermarkus badcladedeletionsupertreesafastandaccuratesupertreealgorithm AT bockersebastian badcladedeletionsupertreesafastandaccuratesupertreealgorithm |