Cargando…
BCD Beam Search: considering suboptimal partial solutions in Bad Clade Deletion supertrees
Supertree methods enable the reconstruction of large phylogenies. The supertree problem can be formalized in different ways in order to cope with contradictory information in the input. Some supertree methods are based on encoding the input trees in a matrix; other methods try to find minimum cuts i...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5995099/ https://www.ncbi.nlm.nih.gov/pubmed/29900080 http://dx.doi.org/10.7717/peerj.4987 |
_version_ | 1783330557873094656 |
---|---|
author | Fleischauer, Markus Böcker, Sebastian |
author_facet | Fleischauer, Markus Böcker, Sebastian |
author_sort | Fleischauer, Markus |
collection | PubMed |
description | Supertree methods enable the reconstruction of large phylogenies. The supertree problem can be formalized in different ways in order to cope with contradictory information in the input. Some supertree methods are based on encoding the input trees in a matrix; other methods try to find minimum cuts in some graph. Recently, we introduced Bad Clade Deletion (BCD) supertrees which combines the graph-based computation of minimum cuts with optimizing a global objective function on the matrix representation of the input trees. The BCD supertree method has guaranteed polynomial running time and is very swift in practice. The quality of reconstructed supertrees was superior to matrix representation with parsimony (MRP) and usually on par with SuperFine for simulated data; but particularly for biological data, quality of BCD supertrees could not keep up with SuperFine supertrees. Here, we present a beam search extension for the BCD algorithm that keeps alive a constant number of partial solutions in each top-down iteration phase. The guaranteed worst-case running time of the new algorithm is still polynomial in the size of the input. We present an exact and a randomized subroutine to generate suboptimal partial solutions. Both beam search approaches consistently improve supertree quality on all evaluated datasets when keeping 25 suboptimal solutions alive. Supertree quality of the BCD Beam Search algorithm is on par with MRP and SuperFine even for biological data. This is the best performance of a polynomial-time supertree algorithm reported so far. |
format | Online Article Text |
id | pubmed-5995099 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-59950992018-06-13 BCD Beam Search: considering suboptimal partial solutions in Bad Clade Deletion supertrees Fleischauer, Markus Böcker, Sebastian PeerJ Bioinformatics Supertree methods enable the reconstruction of large phylogenies. The supertree problem can be formalized in different ways in order to cope with contradictory information in the input. Some supertree methods are based on encoding the input trees in a matrix; other methods try to find minimum cuts in some graph. Recently, we introduced Bad Clade Deletion (BCD) supertrees which combines the graph-based computation of minimum cuts with optimizing a global objective function on the matrix representation of the input trees. The BCD supertree method has guaranteed polynomial running time and is very swift in practice. The quality of reconstructed supertrees was superior to matrix representation with parsimony (MRP) and usually on par with SuperFine for simulated data; but particularly for biological data, quality of BCD supertrees could not keep up with SuperFine supertrees. Here, we present a beam search extension for the BCD algorithm that keeps alive a constant number of partial solutions in each top-down iteration phase. The guaranteed worst-case running time of the new algorithm is still polynomial in the size of the input. We present an exact and a randomized subroutine to generate suboptimal partial solutions. Both beam search approaches consistently improve supertree quality on all evaluated datasets when keeping 25 suboptimal solutions alive. Supertree quality of the BCD Beam Search algorithm is on par with MRP and SuperFine even for biological data. This is the best performance of a polynomial-time supertree algorithm reported so far. PeerJ Inc. 2018-06-08 /pmc/articles/PMC5995099/ /pubmed/29900080 http://dx.doi.org/10.7717/peerj.4987 Text en © 2018 Fleischauer and Böcker http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Bioinformatics Fleischauer, Markus Böcker, Sebastian BCD Beam Search: considering suboptimal partial solutions in Bad Clade Deletion supertrees |
title | BCD Beam Search: considering suboptimal partial solutions in Bad Clade Deletion supertrees |
title_full | BCD Beam Search: considering suboptimal partial solutions in Bad Clade Deletion supertrees |
title_fullStr | BCD Beam Search: considering suboptimal partial solutions in Bad Clade Deletion supertrees |
title_full_unstemmed | BCD Beam Search: considering suboptimal partial solutions in Bad Clade Deletion supertrees |
title_short | BCD Beam Search: considering suboptimal partial solutions in Bad Clade Deletion supertrees |
title_sort | bcd beam search: considering suboptimal partial solutions in bad clade deletion supertrees |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5995099/ https://www.ncbi.nlm.nih.gov/pubmed/29900080 http://dx.doi.org/10.7717/peerj.4987 |
work_keys_str_mv | AT fleischauermarkus bcdbeamsearchconsideringsuboptimalpartialsolutionsinbadcladedeletionsupertrees AT bockersebastian bcdbeamsearchconsideringsuboptimalpartialsolutionsinbadcladedeletionsupertrees |