Cargando…

GRAPES-DD: exploiting decision diagrams for index-driven search in biological graph databases

BACKGROUND: Graphs are mathematical structures widely used for expressing relationships among elements when representing biomedical and biological information. On top of these representations, several analyses are performed. A common task is the search of one substructure within one graph, called ta...

Descripción completa

Detalles Bibliográficos
Autores principales: Licheri, Nicola, Bonnici, Vincenzo, Beccuti, Marco, Giugno, Rosalba
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8061067/
https://www.ncbi.nlm.nih.gov/pubmed/33888059
http://dx.doi.org/10.1186/s12859-021-04129-0
_version_ 1783681490643582976
author Licheri, Nicola
Bonnici, Vincenzo
Beccuti, Marco
Giugno, Rosalba
author_facet Licheri, Nicola
Bonnici, Vincenzo
Beccuti, Marco
Giugno, Rosalba
author_sort Licheri, Nicola
collection PubMed
description BACKGROUND: Graphs are mathematical structures widely used for expressing relationships among elements when representing biomedical and biological information. On top of these representations, several analyses are performed. A common task is the search of one substructure within one graph, called target. The problem is referred to as one-to-one subgraph search, and it is known to be NP-complete. Heuristics and indexing techniques can be applied to facilitate the search. Indexing techniques are also exploited in the context of searching in a collection of target graphs, referred to as one-to-many subgraph problem. Filter-and-verification methods that use indexing approaches provide a fast pruning of target graphs or parts of them that do not contain the query. The expensive verification phase is then performed only on the subset of promising targets. Indexing strategies extract graph features at a sufficient granularity level for performing a powerful filtering step. Features are memorized in data structures allowing an efficient access. Indexing size, querying time and filtering power are key points for the development of efficient subgraph searching solutions. RESULTS: An existing approach, GRAPES, has been shown to have good performance in terms of speed-up for both one-to-one and one-to-many cases. However, it suffers in the size of the built index. For this reason, we propose GRAPES-DD, a modified version of GRAPES in which the indexing structure has been replaced with a Decision Diagram. Decision Diagrams are a broad class of data structures widely used to encode and manipulate functions efficiently. Experiments on biomedical structures and synthetic graphs have confirmed our expectation showing that GRAPES-DD has substantially reduced the memory utilization compared to GRAPES without worsening the searching time. CONCLUSION: The use of Decision Diagrams for searching in biochemical and biological graphs is completely new and potentially promising thanks to their ability to encode compactly sets by exploiting their structure and regularity, and to manipulate entire sets of elements at once, instead of exploring each single element explicitly. Search strategies based on Decision Diagram makes the indexing for biochemical graphs, and not only, more affordable allowing us to potentially deal with huge and ever growing collections of biochemical and biological structures.
format Online
Article
Text
id pubmed-8061067
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-80610672021-04-22 GRAPES-DD: exploiting decision diagrams for index-driven search in biological graph databases Licheri, Nicola Bonnici, Vincenzo Beccuti, Marco Giugno, Rosalba BMC Bioinformatics Methodology Article BACKGROUND: Graphs are mathematical structures widely used for expressing relationships among elements when representing biomedical and biological information. On top of these representations, several analyses are performed. A common task is the search of one substructure within one graph, called target. The problem is referred to as one-to-one subgraph search, and it is known to be NP-complete. Heuristics and indexing techniques can be applied to facilitate the search. Indexing techniques are also exploited in the context of searching in a collection of target graphs, referred to as one-to-many subgraph problem. Filter-and-verification methods that use indexing approaches provide a fast pruning of target graphs or parts of them that do not contain the query. The expensive verification phase is then performed only on the subset of promising targets. Indexing strategies extract graph features at a sufficient granularity level for performing a powerful filtering step. Features are memorized in data structures allowing an efficient access. Indexing size, querying time and filtering power are key points for the development of efficient subgraph searching solutions. RESULTS: An existing approach, GRAPES, has been shown to have good performance in terms of speed-up for both one-to-one and one-to-many cases. However, it suffers in the size of the built index. For this reason, we propose GRAPES-DD, a modified version of GRAPES in which the indexing structure has been replaced with a Decision Diagram. Decision Diagrams are a broad class of data structures widely used to encode and manipulate functions efficiently. Experiments on biomedical structures and synthetic graphs have confirmed our expectation showing that GRAPES-DD has substantially reduced the memory utilization compared to GRAPES without worsening the searching time. CONCLUSION: The use of Decision Diagrams for searching in biochemical and biological graphs is completely new and potentially promising thanks to their ability to encode compactly sets by exploiting their structure and regularity, and to manipulate entire sets of elements at once, instead of exploring each single element explicitly. Search strategies based on Decision Diagram makes the indexing for biochemical graphs, and not only, more affordable allowing us to potentially deal with huge and ever growing collections of biochemical and biological structures. BioMed Central 2021-04-22 /pmc/articles/PMC8061067/ /pubmed/33888059 http://dx.doi.org/10.1186/s12859-021-04129-0 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Licheri, Nicola
Bonnici, Vincenzo
Beccuti, Marco
Giugno, Rosalba
GRAPES-DD: exploiting decision diagrams for index-driven search in biological graph databases
title GRAPES-DD: exploiting decision diagrams for index-driven search in biological graph databases
title_full GRAPES-DD: exploiting decision diagrams for index-driven search in biological graph databases
title_fullStr GRAPES-DD: exploiting decision diagrams for index-driven search in biological graph databases
title_full_unstemmed GRAPES-DD: exploiting decision diagrams for index-driven search in biological graph databases
title_short GRAPES-DD: exploiting decision diagrams for index-driven search in biological graph databases
title_sort grapes-dd: exploiting decision diagrams for index-driven search in biological graph databases
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8061067/
https://www.ncbi.nlm.nih.gov/pubmed/33888059
http://dx.doi.org/10.1186/s12859-021-04129-0
work_keys_str_mv AT licherinicola grapesddexploitingdecisiondiagramsforindexdrivensearchinbiologicalgraphdatabases
AT bonnicivincenzo grapesddexploitingdecisiondiagramsforindexdrivensearchinbiologicalgraphdatabases
AT beccutimarco grapesddexploitingdecisiondiagramsforindexdrivensearchinbiologicalgraphdatabases
AT giugnorosalba grapesddexploitingdecisiondiagramsforindexdrivensearchinbiologicalgraphdatabases