Cargando…

Introducing TreeCollapse: a novel greedy algorithm to solve the cophylogeny reconstruction problem

BACKGROUND: Cophylogeny mapping is used to uncover deep coevolutionary associations between two or more phylogenetic histories at a macro coevolutionary scale. As cophylogeny mapping is NP-Hard, this technique relies heavily on heuristics to solve all but the most trivial cases. One notable approach...

Descripción completa

Detalles Bibliográficos
Autores principales: Drinkwater, Benjamin, Charleston, Michael A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4290644/
https://www.ncbi.nlm.nih.gov/pubmed/25521705
http://dx.doi.org/10.1186/1471-2105-15-S16-S14
_version_ 1782352279003725824
author Drinkwater, Benjamin
Charleston, Michael A
author_facet Drinkwater, Benjamin
Charleston, Michael A
author_sort Drinkwater, Benjamin
collection PubMed
description BACKGROUND: Cophylogeny mapping is used to uncover deep coevolutionary associations between two or more phylogenetic histories at a macro coevolutionary scale. As cophylogeny mapping is NP-Hard, this technique relies heavily on heuristics to solve all but the most trivial cases. One notable approach utilises a metaheuristic to search only a subset of the exponential number of fixed node orderings possible for the phylogenetic histories in question. This is of particular interest as it is the only known heuristic that guarantees biologically feasible solutions. This has enabled research to focus on larger coevolutionary systems, such as coevolutionary associations between figs and their pollinator wasps, including over 200 taxa. Although able to converge on solutions for problem instances of this size, a reduction from the current cubic running time is required to handle larger systems, such as Wolbachia and their insect hosts. RESULTS: Rather than solving this underlying problem optimally this work presents a greedy algorithm called TreeCollapse, which uses common topological patterns to recover an approximation of the coevolutionary history where the internal node ordering is fixed. This approach offers a significant speed-up compared to previous methods, running in linear time. This algorithm has been applied to over 100 well-known coevolutionary systems converging on Pareto optimal solutions in over 68% of test cases, even where in some cases the Pareto optimal solution has not previously been recoverable. Further, while TreeCollapse applies a local search technique, it can guarantee solutions are biologically feasible, making this the fastest method that can provide such a guarantee. CONCLUSION: As a result, we argue that the newly proposed algorithm is a valuable addition to the field of coevolutionary research. Not only does it offer a significantly faster method to estimate the cost of cophylogeny mappings but by using this approach, in conjunction with existing heuristics, it can assist in recovering a larger subset of the Pareto front than has previously been possible.
format Online
Article
Text
id pubmed-4290644
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42906442015-01-15 Introducing TreeCollapse: a novel greedy algorithm to solve the cophylogeny reconstruction problem Drinkwater, Benjamin Charleston, Michael A BMC Bioinformatics Research BACKGROUND: Cophylogeny mapping is used to uncover deep coevolutionary associations between two or more phylogenetic histories at a macro coevolutionary scale. As cophylogeny mapping is NP-Hard, this technique relies heavily on heuristics to solve all but the most trivial cases. One notable approach utilises a metaheuristic to search only a subset of the exponential number of fixed node orderings possible for the phylogenetic histories in question. This is of particular interest as it is the only known heuristic that guarantees biologically feasible solutions. This has enabled research to focus on larger coevolutionary systems, such as coevolutionary associations between figs and their pollinator wasps, including over 200 taxa. Although able to converge on solutions for problem instances of this size, a reduction from the current cubic running time is required to handle larger systems, such as Wolbachia and their insect hosts. RESULTS: Rather than solving this underlying problem optimally this work presents a greedy algorithm called TreeCollapse, which uses common topological patterns to recover an approximation of the coevolutionary history where the internal node ordering is fixed. This approach offers a significant speed-up compared to previous methods, running in linear time. This algorithm has been applied to over 100 well-known coevolutionary systems converging on Pareto optimal solutions in over 68% of test cases, even where in some cases the Pareto optimal solution has not previously been recoverable. Further, while TreeCollapse applies a local search technique, it can guarantee solutions are biologically feasible, making this the fastest method that can provide such a guarantee. CONCLUSION: As a result, we argue that the newly proposed algorithm is a valuable addition to the field of coevolutionary research. Not only does it offer a significantly faster method to estimate the cost of cophylogeny mappings but by using this approach, in conjunction with existing heuristics, it can assist in recovering a larger subset of the Pareto front than has previously been possible. BioMed Central 2014-12-08 /pmc/articles/PMC4290644/ /pubmed/25521705 http://dx.doi.org/10.1186/1471-2105-15-S16-S14 Text en Copyright © 2014 Drinkwater and Charleston; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Drinkwater, Benjamin
Charleston, Michael A
Introducing TreeCollapse: a novel greedy algorithm to solve the cophylogeny reconstruction problem
title Introducing TreeCollapse: a novel greedy algorithm to solve the cophylogeny reconstruction problem
title_full Introducing TreeCollapse: a novel greedy algorithm to solve the cophylogeny reconstruction problem
title_fullStr Introducing TreeCollapse: a novel greedy algorithm to solve the cophylogeny reconstruction problem
title_full_unstemmed Introducing TreeCollapse: a novel greedy algorithm to solve the cophylogeny reconstruction problem
title_short Introducing TreeCollapse: a novel greedy algorithm to solve the cophylogeny reconstruction problem
title_sort introducing treecollapse: a novel greedy algorithm to solve the cophylogeny reconstruction problem
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4290644/
https://www.ncbi.nlm.nih.gov/pubmed/25521705
http://dx.doi.org/10.1186/1471-2105-15-S16-S14
work_keys_str_mv AT drinkwaterbenjamin introducingtreecollapseanovelgreedyalgorithmtosolvethecophylogenyreconstructionproblem
AT charlestonmichaela introducingtreecollapseanovelgreedyalgorithmtosolvethecophylogenyreconstructionproblem