Cargando…
A practical approximation algorithm for solving massive instances of hybridization number for binary and nonbinary trees
BACKGROUND: Reticulate events play an important role in determining evolutionary relationships. The problem of computing the minimum number of such events to explain discordance between two phylogenetic trees is a hard computational problem. Even for binary trees, exact solvers struggle to solve ins...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4023542/ https://www.ncbi.nlm.nih.gov/pubmed/24884964 http://dx.doi.org/10.1186/1471-2105-15-127 |
_version_ | 1782316562189910016 |
---|---|
author | Iersel, Leo van Kelk, Steven Lekić, Nela Scornavacca, Celine |
author_facet | Iersel, Leo van Kelk, Steven Lekić, Nela Scornavacca, Celine |
author_sort | Iersel, Leo van |
collection | PubMed |
description | BACKGROUND: Reticulate events play an important role in determining evolutionary relationships. The problem of computing the minimum number of such events to explain discordance between two phylogenetic trees is a hard computational problem. Even for binary trees, exact solvers struggle to solve instances with reticulation number larger than 40-50. RESULTS: Here we present CycleKiller and NonbinaryCycleKiller, the first methods to produce solutions verifiably close to optimality for instances with hundreds or even thousands of reticulations. CONCLUSIONS: Using simulations, we demonstrate that these algorithms run quickly for large and difficult instances, producing solutions that are very close to optimality. As a spin-off from our simulations we also present TerminusEst, which is the fastest exact method currently available that can handle nonbinary trees: this is used to measure the accuracy of the NonbinaryCycleKiller algorithm. All three methods are based on extensions of previous theoretical work (SIDMA 26(4):1635-1656, TCBB 10(1):18-25, SIDMA 28(1):49-66) and are publicly available. We also apply our methods to real data. |
format | Online Article Text |
id | pubmed-4023542 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40235422014-05-28 A practical approximation algorithm for solving massive instances of hybridization number for binary and nonbinary trees Iersel, Leo van Kelk, Steven Lekić, Nela Scornavacca, Celine BMC Bioinformatics Research Article BACKGROUND: Reticulate events play an important role in determining evolutionary relationships. The problem of computing the minimum number of such events to explain discordance between two phylogenetic trees is a hard computational problem. Even for binary trees, exact solvers struggle to solve instances with reticulation number larger than 40-50. RESULTS: Here we present CycleKiller and NonbinaryCycleKiller, the first methods to produce solutions verifiably close to optimality for instances with hundreds or even thousands of reticulations. CONCLUSIONS: Using simulations, we demonstrate that these algorithms run quickly for large and difficult instances, producing solutions that are very close to optimality. As a spin-off from our simulations we also present TerminusEst, which is the fastest exact method currently available that can handle nonbinary trees: this is used to measure the accuracy of the NonbinaryCycleKiller algorithm. All three methods are based on extensions of previous theoretical work (SIDMA 26(4):1635-1656, TCBB 10(1):18-25, SIDMA 28(1):49-66) and are publicly available. We also apply our methods to real data. BioMed Central 2014-05-05 /pmc/articles/PMC4023542/ /pubmed/24884964 http://dx.doi.org/10.1186/1471-2105-15-127 Text en Copyright © 2014 van Iersel et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Iersel, Leo van Kelk, Steven Lekić, Nela Scornavacca, Celine A practical approximation algorithm for solving massive instances of hybridization number for binary and nonbinary trees |
title | A practical approximation algorithm for solving massive instances of hybridization number for binary and nonbinary trees |
title_full | A practical approximation algorithm for solving massive instances of hybridization number for binary and nonbinary trees |
title_fullStr | A practical approximation algorithm for solving massive instances of hybridization number for binary and nonbinary trees |
title_full_unstemmed | A practical approximation algorithm for solving massive instances of hybridization number for binary and nonbinary trees |
title_short | A practical approximation algorithm for solving massive instances of hybridization number for binary and nonbinary trees |
title_sort | practical approximation algorithm for solving massive instances of hybridization number for binary and nonbinary trees |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4023542/ https://www.ncbi.nlm.nih.gov/pubmed/24884964 http://dx.doi.org/10.1186/1471-2105-15-127 |
work_keys_str_mv | AT ierselleovan apracticalapproximationalgorithmforsolvingmassiveinstancesofhybridizationnumberforbinaryandnonbinarytrees AT kelksteven apracticalapproximationalgorithmforsolvingmassiveinstancesofhybridizationnumberforbinaryandnonbinarytrees AT lekicnela apracticalapproximationalgorithmforsolvingmassiveinstancesofhybridizationnumberforbinaryandnonbinarytrees AT scornavaccaceline apracticalapproximationalgorithmforsolvingmassiveinstancesofhybridizationnumberforbinaryandnonbinarytrees AT ierselleovan practicalapproximationalgorithmforsolvingmassiveinstancesofhybridizationnumberforbinaryandnonbinarytrees AT kelksteven practicalapproximationalgorithmforsolvingmassiveinstancesofhybridizationnumberforbinaryandnonbinarytrees AT lekicnela practicalapproximationalgorithmforsolvingmassiveinstancesofhybridizationnumberforbinaryandnonbinarytrees AT scornavaccaceline practicalapproximationalgorithmforsolvingmassiveinstancesofhybridizationnumberforbinaryandnonbinarytrees |