Cargando…

Fast Structural Search in Phylogenetic Databases

As the size of phylogenetic databases grows, the need for efficiently searching these databases arises. Thanks to previous and ongoing research, searching by attribute value and by text has become commonplace in these databases. However, searching by topological or physical structure, especially for...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Jason T. L., Shan, Huiyuan, Shasha, Dennis, Piel, William H.
Formato: Texto
Lenguaje:English
Publicado: Libertas Academica 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2658875/
https://www.ncbi.nlm.nih.gov/pubmed/19325851
_version_ 1782165657571295232
author Wang, Jason T. L.
Shan, Huiyuan
Shasha, Dennis
Piel, William H.
author_facet Wang, Jason T. L.
Shan, Huiyuan
Shasha, Dennis
Piel, William H.
author_sort Wang, Jason T. L.
collection PubMed
description As the size of phylogenetic databases grows, the need for efficiently searching these databases arises. Thanks to previous and ongoing research, searching by attribute value and by text has become commonplace in these databases. However, searching by topological or physical structure, especially for large databases and especially for approximate matches, is still an art. We propose structural search techniques that, given a query or pattern tree P and a database of phylogenies D, find trees in D that are sufficiently close to P. The “closeness” is a measure of the topological relationships in P that are found to be the same or similar in a tree D in D. We develop a filtering technique that accelerates searches and present algorithms for rooted and unrooted trees where the trees can be weighted or unweighted. Experimental results on comparing the similarity measure with existing tree metrics and on evaluating the efficiency of the search techniques demonstrate that the proposed approach is promising.
format Text
id pubmed-2658875
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-26588752009-03-25 Fast Structural Search in Phylogenetic Databases Wang, Jason T. L. Shan, Huiyuan Shasha, Dennis Piel, William H. Evol Bioinform Online Original Research As the size of phylogenetic databases grows, the need for efficiently searching these databases arises. Thanks to previous and ongoing research, searching by attribute value and by text has become commonplace in these databases. However, searching by topological or physical structure, especially for large databases and especially for approximate matches, is still an art. We propose structural search techniques that, given a query or pattern tree P and a database of phylogenies D, find trees in D that are sufficiently close to P. The “closeness” is a measure of the topological relationships in P that are found to be the same or similar in a tree D in D. We develop a filtering technique that accelerates searches and present algorithms for rooted and unrooted trees where the trees can be weighted or unweighted. Experimental results on comparing the similarity measure with existing tree metrics and on evaluating the efficiency of the search techniques demonstrate that the proposed approach is promising. Libertas Academica 2007-02-20 /pmc/articles/PMC2658875/ /pubmed/19325851 Text en Copyright © 2005 The authors. http://creativecommons.org/licenses/by/3.0 This article is published under the Creative Commons Attribution By licence. For further information go to: http://creativecommons.org/licenses/by/3.0. (http://creativecommons.org/licenses/by/3.0)
spellingShingle Original Research
Wang, Jason T. L.
Shan, Huiyuan
Shasha, Dennis
Piel, William H.
Fast Structural Search in Phylogenetic Databases
title Fast Structural Search in Phylogenetic Databases
title_full Fast Structural Search in Phylogenetic Databases
title_fullStr Fast Structural Search in Phylogenetic Databases
title_full_unstemmed Fast Structural Search in Phylogenetic Databases
title_short Fast Structural Search in Phylogenetic Databases
title_sort fast structural search in phylogenetic databases
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2658875/
https://www.ncbi.nlm.nih.gov/pubmed/19325851
work_keys_str_mv AT wangjasontl faststructuralsearchinphylogeneticdatabases
AT shanhuiyuan faststructuralsearchinphylogeneticdatabases
AT shashadennis faststructuralsearchinphylogeneticdatabases
AT pielwilliamh faststructuralsearchinphylogeneticdatabases