Cargando…

A practical approximation algorithm for solving massive instances of hybridization number for binary and nonbinary trees

BACKGROUND: Reticulate events play an important role in determining evolutionary relationships. The problem of computing the minimum number of such events to explain discordance between two phylogenetic trees is a hard computational problem. Even for binary trees, exact solvers struggle to solve ins...

Descripción completa

Detalles Bibliográficos
Autores principales: Iersel, Leo van, Kelk, Steven, Lekić, Nela, Scornavacca, Celine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4023542/
https://www.ncbi.nlm.nih.gov/pubmed/24884964
http://dx.doi.org/10.1186/1471-2105-15-127
_version_ 1782316562189910016
author Iersel, Leo van
Kelk, Steven
Lekić, Nela
Scornavacca, Celine
author_facet Iersel, Leo van
Kelk, Steven
Lekić, Nela
Scornavacca, Celine
author_sort Iersel, Leo van
collection PubMed
description BACKGROUND: Reticulate events play an important role in determining evolutionary relationships. The problem of computing the minimum number of such events to explain discordance between two phylogenetic trees is a hard computational problem. Even for binary trees, exact solvers struggle to solve instances with reticulation number larger than 40-50. RESULTS: Here we present CycleKiller and NonbinaryCycleKiller, the first methods to produce solutions verifiably close to optimality for instances with hundreds or even thousands of reticulations. CONCLUSIONS: Using simulations, we demonstrate that these algorithms run quickly for large and difficult instances, producing solutions that are very close to optimality. As a spin-off from our simulations we also present TerminusEst, which is the fastest exact method currently available that can handle nonbinary trees: this is used to measure the accuracy of the NonbinaryCycleKiller algorithm. All three methods are based on extensions of previous theoretical work (SIDMA 26(4):1635-1656, TCBB 10(1):18-25, SIDMA 28(1):49-66) and are publicly available. We also apply our methods to real data.
format Online
Article
Text
id pubmed-4023542
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40235422014-05-28 A practical approximation algorithm for solving massive instances of hybridization number for binary and nonbinary trees Iersel, Leo van Kelk, Steven Lekić, Nela Scornavacca, Celine BMC Bioinformatics Research Article BACKGROUND: Reticulate events play an important role in determining evolutionary relationships. The problem of computing the minimum number of such events to explain discordance between two phylogenetic trees is a hard computational problem. Even for binary trees, exact solvers struggle to solve instances with reticulation number larger than 40-50. RESULTS: Here we present CycleKiller and NonbinaryCycleKiller, the first methods to produce solutions verifiably close to optimality for instances with hundreds or even thousands of reticulations. CONCLUSIONS: Using simulations, we demonstrate that these algorithms run quickly for large and difficult instances, producing solutions that are very close to optimality. As a spin-off from our simulations we also present TerminusEst, which is the fastest exact method currently available that can handle nonbinary trees: this is used to measure the accuracy of the NonbinaryCycleKiller algorithm. All three methods are based on extensions of previous theoretical work (SIDMA 26(4):1635-1656, TCBB 10(1):18-25, SIDMA 28(1):49-66) and are publicly available. We also apply our methods to real data. BioMed Central 2014-05-05 /pmc/articles/PMC4023542/ /pubmed/24884964 http://dx.doi.org/10.1186/1471-2105-15-127 Text en Copyright © 2014 van Iersel et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Iersel, Leo van
Kelk, Steven
Lekić, Nela
Scornavacca, Celine
A practical approximation algorithm for solving massive instances of hybridization number for binary and nonbinary trees
title A practical approximation algorithm for solving massive instances of hybridization number for binary and nonbinary trees
title_full A practical approximation algorithm for solving massive instances of hybridization number for binary and nonbinary trees
title_fullStr A practical approximation algorithm for solving massive instances of hybridization number for binary and nonbinary trees
title_full_unstemmed A practical approximation algorithm for solving massive instances of hybridization number for binary and nonbinary trees
title_short A practical approximation algorithm for solving massive instances of hybridization number for binary and nonbinary trees
title_sort practical approximation algorithm for solving massive instances of hybridization number for binary and nonbinary trees
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4023542/
https://www.ncbi.nlm.nih.gov/pubmed/24884964
http://dx.doi.org/10.1186/1471-2105-15-127
work_keys_str_mv AT ierselleovan apracticalapproximationalgorithmforsolvingmassiveinstancesofhybridizationnumberforbinaryandnonbinarytrees
AT kelksteven apracticalapproximationalgorithmforsolvingmassiveinstancesofhybridizationnumberforbinaryandnonbinarytrees
AT lekicnela apracticalapproximationalgorithmforsolvingmassiveinstancesofhybridizationnumberforbinaryandnonbinarytrees
AT scornavaccaceline apracticalapproximationalgorithmforsolvingmassiveinstancesofhybridizationnumberforbinaryandnonbinarytrees
AT ierselleovan practicalapproximationalgorithmforsolvingmassiveinstancesofhybridizationnumberforbinaryandnonbinarytrees
AT kelksteven practicalapproximationalgorithmforsolvingmassiveinstancesofhybridizationnumberforbinaryandnonbinarytrees
AT lekicnela practicalapproximationalgorithmforsolvingmassiveinstancesofhybridizationnumberforbinaryandnonbinarytrees
AT scornavaccaceline practicalapproximationalgorithmforsolvingmassiveinstancesofhybridizationnumberforbinaryandnonbinarytrees