Cargando…

SING: Subgraph search In Non-homogeneous Graphs

BACKGROUND: Finding the subgraphs of a graph database that are isomorphic to a given query graph has practical applications in several fields, from cheminformatics to image understanding. Since subgraph isomorphism is a computationally hard problem, indexing techniques have been intensively exploite...

Descripción completa

Detalles Bibliográficos
Autores principales: Di Natale, Raffaele, Ferro, Alfredo, Giugno, Rosalba, Mongiovì, Misael, Pulvirenti, Alfredo, Shasha, Dennis
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2850364/
https://www.ncbi.nlm.nih.gov/pubmed/20170516
http://dx.doi.org/10.1186/1471-2105-11-96
_version_ 1782179778121433088
author Di Natale, Raffaele
Ferro, Alfredo
Giugno, Rosalba
Mongiovì, Misael
Pulvirenti, Alfredo
Shasha, Dennis
author_facet Di Natale, Raffaele
Ferro, Alfredo
Giugno, Rosalba
Mongiovì, Misael
Pulvirenti, Alfredo
Shasha, Dennis
author_sort Di Natale, Raffaele
collection PubMed
description BACKGROUND: Finding the subgraphs of a graph database that are isomorphic to a given query graph has practical applications in several fields, from cheminformatics to image understanding. Since subgraph isomorphism is a computationally hard problem, indexing techniques have been intensively exploited to speed up the process. Such systems filter out those graphs which cannot contain the query, and apply a subgraph isomorphism algorithm to each residual candidate graph. The applicability of such systems is limited to databases of small graphs, because their filtering power degrades on large graphs. RESULTS: In this paper, SING (Subgraph search In Non-homogeneous Graphs), a novel indexing system able to cope with large graphs, is presented. The method uses the notion of feature, which can be a small subgraph, subtree or path. Each graph in the database is annotated with the set of all its features. The key point is to make use of feature locality information. This idea is used to both improve the filtering performance and speed up the subgraph isomorphism task. CONCLUSIONS: Extensive tests on chemical compounds, biological networks and synthetic graphs show that the proposed system outperforms the most popular systems in query time over databases of medium and large graphs. Other specific tests show that the proposed system is effective for single large graphs.
format Text
id pubmed-2850364
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28503642010-04-07 SING: Subgraph search In Non-homogeneous Graphs Di Natale, Raffaele Ferro, Alfredo Giugno, Rosalba Mongiovì, Misael Pulvirenti, Alfredo Shasha, Dennis BMC Bioinformatics Methodology article BACKGROUND: Finding the subgraphs of a graph database that are isomorphic to a given query graph has practical applications in several fields, from cheminformatics to image understanding. Since subgraph isomorphism is a computationally hard problem, indexing techniques have been intensively exploited to speed up the process. Such systems filter out those graphs which cannot contain the query, and apply a subgraph isomorphism algorithm to each residual candidate graph. The applicability of such systems is limited to databases of small graphs, because their filtering power degrades on large graphs. RESULTS: In this paper, SING (Subgraph search In Non-homogeneous Graphs), a novel indexing system able to cope with large graphs, is presented. The method uses the notion of feature, which can be a small subgraph, subtree or path. Each graph in the database is annotated with the set of all its features. The key point is to make use of feature locality information. This idea is used to both improve the filtering performance and speed up the subgraph isomorphism task. CONCLUSIONS: Extensive tests on chemical compounds, biological networks and synthetic graphs show that the proposed system outperforms the most popular systems in query time over databases of medium and large graphs. Other specific tests show that the proposed system is effective for single large graphs. BioMed Central 2010-02-19 /pmc/articles/PMC2850364/ /pubmed/20170516 http://dx.doi.org/10.1186/1471-2105-11-96 Text en Copyright ©2010 Di Natale et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology article
Di Natale, Raffaele
Ferro, Alfredo
Giugno, Rosalba
Mongiovì, Misael
Pulvirenti, Alfredo
Shasha, Dennis
SING: Subgraph search In Non-homogeneous Graphs
title SING: Subgraph search In Non-homogeneous Graphs
title_full SING: Subgraph search In Non-homogeneous Graphs
title_fullStr SING: Subgraph search In Non-homogeneous Graphs
title_full_unstemmed SING: Subgraph search In Non-homogeneous Graphs
title_short SING: Subgraph search In Non-homogeneous Graphs
title_sort sing: subgraph search in non-homogeneous graphs
topic Methodology article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2850364/
https://www.ncbi.nlm.nih.gov/pubmed/20170516
http://dx.doi.org/10.1186/1471-2105-11-96
work_keys_str_mv AT dinataleraffaele singsubgraphsearchinnonhomogeneousgraphs
AT ferroalfredo singsubgraphsearchinnonhomogeneousgraphs
AT giugnorosalba singsubgraphsearchinnonhomogeneousgraphs
AT mongiovimisael singsubgraphsearchinnonhomogeneousgraphs
AT pulvirentialfredo singsubgraphsearchinnonhomogeneousgraphs
AT shashadennis singsubgraphsearchinnonhomogeneousgraphs