Cargando…

SPR Distance Computation for Unrooted Trees

The subtree prune and regraft distance (d(SPR)) between phylogenetic trees is important both as a general means of comparing phylogenetic tree topologies as well as a measure of lateral gene transfer (LGT). Although there has been extensive study on the computation of d(SPR) and similar metrics betw...

Descripción completa

Detalles Bibliográficos
Autores principales: Hickey, Glenn, Dehne, Frank, Rau-Chaplin, Andrew, Blouin, Christian
Formato: Texto
Lenguaje:English
Publicado: Libertas Academica 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2614206/
https://www.ncbi.nlm.nih.gov/pubmed/19204804
_version_ 1782163223473029120
author Hickey, Glenn
Dehne, Frank
Rau-Chaplin, Andrew
Blouin, Christian
author_facet Hickey, Glenn
Dehne, Frank
Rau-Chaplin, Andrew
Blouin, Christian
author_sort Hickey, Glenn
collection PubMed
description The subtree prune and regraft distance (d(SPR)) between phylogenetic trees is important both as a general means of comparing phylogenetic tree topologies as well as a measure of lateral gene transfer (LGT). Although there has been extensive study on the computation of d(SPR) and similar metrics between rooted trees, much less is known about SPR distances for unrooted trees, which often arise in practice when the root is unresolved. We show that unrooted SPR distance computation is NP-Hard and verify which techniques from related work can and cannot be applied. We then present an efficient heuristic algorithm for this problem and benchmark it on a variety of synthetic datasets. Our algorithm computes the exact SPR distance between unrooted tree, and the heuristic element is only with respect to the algorithm’s computation time. Our method is a heuristic version of a fixed parameter tractability (FPT) approach and our experiments indicate that the running time behaves similar to FPT algorithms. For real data sets, our algorithm was able to quickly compute d(SPR) for the majority of trees that were part of a study of LGT in 144 prokaryotic genomes. Our analysis of its performance, especially with respect to searching and reduction rules, is applicable to computing many related distance measures.
format Text
id pubmed-2614206
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-26142062009-02-09 SPR Distance Computation for Unrooted Trees Hickey, Glenn Dehne, Frank Rau-Chaplin, Andrew Blouin, Christian Evol Bioinform Online Original Research The subtree prune and regraft distance (d(SPR)) between phylogenetic trees is important both as a general means of comparing phylogenetic tree topologies as well as a measure of lateral gene transfer (LGT). Although there has been extensive study on the computation of d(SPR) and similar metrics between rooted trees, much less is known about SPR distances for unrooted trees, which often arise in practice when the root is unresolved. We show that unrooted SPR distance computation is NP-Hard and verify which techniques from related work can and cannot be applied. We then present an efficient heuristic algorithm for this problem and benchmark it on a variety of synthetic datasets. Our algorithm computes the exact SPR distance between unrooted tree, and the heuristic element is only with respect to the algorithm’s computation time. Our method is a heuristic version of a fixed parameter tractability (FPT) approach and our experiments indicate that the running time behaves similar to FPT algorithms. For real data sets, our algorithm was able to quickly compute d(SPR) for the majority of trees that were part of a study of LGT in 144 prokaryotic genomes. Our analysis of its performance, especially with respect to searching and reduction rules, is applicable to computing many related distance measures. Libertas Academica 2008-02-09 /pmc/articles/PMC2614206/ /pubmed/19204804 Text en Copyright © 2008 The authors. http://creativecommons.org/licenses/by/3.0 This article is published under the Creative Commons Attribution By licence. For further information go to: http://creativecommons.org/licenses/by/3.0. (http://creativecommons.org/licenses/by/3.0)
spellingShingle Original Research
Hickey, Glenn
Dehne, Frank
Rau-Chaplin, Andrew
Blouin, Christian
SPR Distance Computation for Unrooted Trees
title SPR Distance Computation for Unrooted Trees
title_full SPR Distance Computation for Unrooted Trees
title_fullStr SPR Distance Computation for Unrooted Trees
title_full_unstemmed SPR Distance Computation for Unrooted Trees
title_short SPR Distance Computation for Unrooted Trees
title_sort spr distance computation for unrooted trees
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2614206/
https://www.ncbi.nlm.nih.gov/pubmed/19204804
work_keys_str_mv AT hickeyglenn sprdistancecomputationforunrootedtrees
AT dehnefrank sprdistancecomputationforunrootedtrees
AT rauchaplinandrew sprdistancecomputationforunrootedtrees
AT blouinchristian sprdistancecomputationforunrootedtrees