Cargando…

Hierarchical clustering of maximum parsimony reconciliations

BACKGROUND: Maximum parsimony reconciliation in the duplication-transfer-loss model is a widely-used method for analyzing the evolutionary histories of pairs of entities such as hosts and parasites, symbiont species, and species and genes. While efficient algorithms are known for finding maximum par...

Descripción completa

Detalles Bibliográficos
Autores principales: Mawhorter, Ross, Libeskind-Hadas, Ran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6882150/
https://www.ncbi.nlm.nih.gov/pubmed/31775628
http://dx.doi.org/10.1186/s12859-019-3223-5
_version_ 1783474092101337088
author Mawhorter, Ross
Libeskind-Hadas, Ran
author_facet Mawhorter, Ross
Libeskind-Hadas, Ran
author_sort Mawhorter, Ross
collection PubMed
description BACKGROUND: Maximum parsimony reconciliation in the duplication-transfer-loss model is a widely-used method for analyzing the evolutionary histories of pairs of entities such as hosts and parasites, symbiont species, and species and genes. While efficient algorithms are known for finding maximum parsimony reconciliations, the number of such reconciliations can be exponential in the size of the trees. Since these reconciliations can differ substantially from one another, making inferences from any one reconciliation may lead to conclusions that are not supported, or may even be contradicted, by other maximum parsimony reconciliations. Therefore, there is a need to find small sets of best representative reconciliations when the space of solutions is large and diverse. RESULTS: We provide a general framework for hierarchical clustering the space of maximum parsimony reconciliations. We demonstrate this framework for two specific linkage criteria, one that seeks to maximize the average support of the events found in the reconciliations in each cluster and the other that seeks to minimize the distance between reconciliations in each cluster. We analyze the asymptotic worst-case running times and provide experimental results that demonstrate the viability and utility of this approach. CONCLUSIONS: The hierarchical clustering algorithm method proposed here provides a new approach to find a set of representative reconciliations in the potentially vast and diverse space of maximum parsimony reconciliations.
format Online
Article
Text
id pubmed-6882150
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68821502019-12-03 Hierarchical clustering of maximum parsimony reconciliations Mawhorter, Ross Libeskind-Hadas, Ran BMC Bioinformatics Methodology Article BACKGROUND: Maximum parsimony reconciliation in the duplication-transfer-loss model is a widely-used method for analyzing the evolutionary histories of pairs of entities such as hosts and parasites, symbiont species, and species and genes. While efficient algorithms are known for finding maximum parsimony reconciliations, the number of such reconciliations can be exponential in the size of the trees. Since these reconciliations can differ substantially from one another, making inferences from any one reconciliation may lead to conclusions that are not supported, or may even be contradicted, by other maximum parsimony reconciliations. Therefore, there is a need to find small sets of best representative reconciliations when the space of solutions is large and diverse. RESULTS: We provide a general framework for hierarchical clustering the space of maximum parsimony reconciliations. We demonstrate this framework for two specific linkage criteria, one that seeks to maximize the average support of the events found in the reconciliations in each cluster and the other that seeks to minimize the distance between reconciliations in each cluster. We analyze the asymptotic worst-case running times and provide experimental results that demonstrate the viability and utility of this approach. CONCLUSIONS: The hierarchical clustering algorithm method proposed here provides a new approach to find a set of representative reconciliations in the potentially vast and diverse space of maximum parsimony reconciliations. BioMed Central 2019-11-27 /pmc/articles/PMC6882150/ /pubmed/31775628 http://dx.doi.org/10.1186/s12859-019-3223-5 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Mawhorter, Ross
Libeskind-Hadas, Ran
Hierarchical clustering of maximum parsimony reconciliations
title Hierarchical clustering of maximum parsimony reconciliations
title_full Hierarchical clustering of maximum parsimony reconciliations
title_fullStr Hierarchical clustering of maximum parsimony reconciliations
title_full_unstemmed Hierarchical clustering of maximum parsimony reconciliations
title_short Hierarchical clustering of maximum parsimony reconciliations
title_sort hierarchical clustering of maximum parsimony reconciliations
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6882150/
https://www.ncbi.nlm.nih.gov/pubmed/31775628
http://dx.doi.org/10.1186/s12859-019-3223-5
work_keys_str_mv AT mawhorterross hierarchicalclusteringofmaximumparsimonyreconciliations
AT libeskindhadasran hierarchicalclusteringofmaximumparsimonyreconciliations