Cargando…

Bad Clade Deletion Supertrees: A Fast and Accurate Supertree Algorithm

Supertree methods merge a set of overlapping phylogenetic trees into a supertree containing all taxa of the input trees. The challenge in supertree reconstruction is the way of dealing with conflicting information in the input trees. Many different algorithms for different objective functions have b...

Descripción completa

Detalles Bibliográficos
Autores principales: Fleischauer, Markus, Böcker, Sebastian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5850620/
https://www.ncbi.nlm.nih.gov/pubmed/28873954
http://dx.doi.org/10.1093/molbev/msx191
_version_ 1783306255411970048
author Fleischauer, Markus
Böcker, Sebastian
author_facet Fleischauer, Markus
Böcker, Sebastian
author_sort Fleischauer, Markus
collection PubMed
description Supertree methods merge a set of overlapping phylogenetic trees into a supertree containing all taxa of the input trees. The challenge in supertree reconstruction is the way of dealing with conflicting information in the input trees. Many different algorithms for different objective functions have been suggested to resolve these conflicts. In particular, there exist methods based on encoding the source trees in a matrix, where the supertree is constructed applying a local search heuristic to optimize the respective objective function. We present a novel heuristic supertree algorithm called Bad Clade Deletion (BCD) supertrees. It uses minimum cuts to delete a locally minimal number of columns from such a matrix representation so that it is compatible. This is the complement problem to Matrix Representation with Compatibility (Maximum Split Fit). Our algorithm has guaranteed polynomial worst-case running time and performs swiftly in practice. Different from local search heuristics, it guarantees to return the directed perfect phylogeny for the input matrix, corresponding to the parent tree of the input trees, if one exists. Comparing supertrees to model trees for simulated data, BCD shows a better accuracy (F(1) score) than the state-of-the-art algorithms SuperFine (up to 3%) and Matrix Representation with Parsimony (up to 7%); at the same time, BCD is up to 7 times faster than SuperFine, and up to 600 times faster than Matrix Representation with Parsimony. Finally, using the BCD supertree as a starting tree for a combined Maximum Likelihood analysis using RAxML, we reach significantly improved accuracy (1% higher F(1) score) and running time (1.7-fold speedup).
format Online
Article
Text
id pubmed-5850620
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58506202018-03-23 Bad Clade Deletion Supertrees: A Fast and Accurate Supertree Algorithm Fleischauer, Markus Böcker, Sebastian Mol Biol Evol Methods Supertree methods merge a set of overlapping phylogenetic trees into a supertree containing all taxa of the input trees. The challenge in supertree reconstruction is the way of dealing with conflicting information in the input trees. Many different algorithms for different objective functions have been suggested to resolve these conflicts. In particular, there exist methods based on encoding the source trees in a matrix, where the supertree is constructed applying a local search heuristic to optimize the respective objective function. We present a novel heuristic supertree algorithm called Bad Clade Deletion (BCD) supertrees. It uses minimum cuts to delete a locally minimal number of columns from such a matrix representation so that it is compatible. This is the complement problem to Matrix Representation with Compatibility (Maximum Split Fit). Our algorithm has guaranteed polynomial worst-case running time and performs swiftly in practice. Different from local search heuristics, it guarantees to return the directed perfect phylogeny for the input matrix, corresponding to the parent tree of the input trees, if one exists. Comparing supertrees to model trees for simulated data, BCD shows a better accuracy (F(1) score) than the state-of-the-art algorithms SuperFine (up to 3%) and Matrix Representation with Parsimony (up to 7%); at the same time, BCD is up to 7 times faster than SuperFine, and up to 600 times faster than Matrix Representation with Parsimony. Finally, using the BCD supertree as a starting tree for a combined Maximum Likelihood analysis using RAxML, we reach significantly improved accuracy (1% higher F(1) score) and running time (1.7-fold speedup). Oxford University Press 2017-09 2017-07-05 /pmc/articles/PMC5850620/ /pubmed/28873954 http://dx.doi.org/10.1093/molbev/msx191 Text en © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods
Fleischauer, Markus
Böcker, Sebastian
Bad Clade Deletion Supertrees: A Fast and Accurate Supertree Algorithm
title Bad Clade Deletion Supertrees: A Fast and Accurate Supertree Algorithm
title_full Bad Clade Deletion Supertrees: A Fast and Accurate Supertree Algorithm
title_fullStr Bad Clade Deletion Supertrees: A Fast and Accurate Supertree Algorithm
title_full_unstemmed Bad Clade Deletion Supertrees: A Fast and Accurate Supertree Algorithm
title_short Bad Clade Deletion Supertrees: A Fast and Accurate Supertree Algorithm
title_sort bad clade deletion supertrees: a fast and accurate supertree algorithm
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5850620/
https://www.ncbi.nlm.nih.gov/pubmed/28873954
http://dx.doi.org/10.1093/molbev/msx191
work_keys_str_mv AT fleischauermarkus badcladedeletionsupertreesafastandaccuratesupertreealgorithm
AT bockersebastian badcladedeletionsupertreesafastandaccuratesupertreealgorithm