Cargando…
A linear delay algorithm for enumerating all connected induced subgraphs
BACKGROUND: Real biological and social data is increasingly being represented as graphs. Pattern-mining-based graph learning and analysis techniques report meaningful biological subnetworks that elucidate important interactions among entities. At the backbone of these algorithms is the enumeration o...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6584512/ https://www.ncbi.nlm.nih.gov/pubmed/31216984 http://dx.doi.org/10.1186/s12859-019-2837-y |
_version_ | 1783428524227428352 |
---|---|
author | Alokshiya, Mohammed Salem, Saeed Abed, Fidaa |
author_facet | Alokshiya, Mohammed Salem, Saeed Abed, Fidaa |
author_sort | Alokshiya, Mohammed |
collection | PubMed |
description | BACKGROUND: Real biological and social data is increasingly being represented as graphs. Pattern-mining-based graph learning and analysis techniques report meaningful biological subnetworks that elucidate important interactions among entities. At the backbone of these algorithms is the enumeration of pattern space. RESULTS: We propose an efficient algorithm for enumerating all connected induced subgraphs of an undirected graph. Building on this enumeration approach, we propose an algorithm for mining all maximal cohesive subgraphs that integrates vertices’ attributes with subgraph enumeration. To efficiently mine all maximal cohesive subgraphs, we propose two pruning techniques that remove futile search nodes in the enumeration tree. CONCLUSIONS: Experiments on synthetic and real graphs show the effectiveness of the proposed algorithm and the pruning techniques. On enumerating all connected induced subgraphs, our algorithm is several times faster than existing approaches. On dense graphs, the proposed approach is at least an order of magnitude faster than the best existing algorithm. Experiments on protein-protein interaction network with cancer gene dysregulation profile show that the reported cohesive subnetworks are biologically interesting. |
format | Online Article Text |
id | pubmed-6584512 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-65845122019-06-26 A linear delay algorithm for enumerating all connected induced subgraphs Alokshiya, Mohammed Salem, Saeed Abed, Fidaa BMC Bioinformatics Research BACKGROUND: Real biological and social data is increasingly being represented as graphs. Pattern-mining-based graph learning and analysis techniques report meaningful biological subnetworks that elucidate important interactions among entities. At the backbone of these algorithms is the enumeration of pattern space. RESULTS: We propose an efficient algorithm for enumerating all connected induced subgraphs of an undirected graph. Building on this enumeration approach, we propose an algorithm for mining all maximal cohesive subgraphs that integrates vertices’ attributes with subgraph enumeration. To efficiently mine all maximal cohesive subgraphs, we propose two pruning techniques that remove futile search nodes in the enumeration tree. CONCLUSIONS: Experiments on synthetic and real graphs show the effectiveness of the proposed algorithm and the pruning techniques. On enumerating all connected induced subgraphs, our algorithm is several times faster than existing approaches. On dense graphs, the proposed approach is at least an order of magnitude faster than the best existing algorithm. Experiments on protein-protein interaction network with cancer gene dysregulation profile show that the reported cohesive subnetworks are biologically interesting. BioMed Central 2019-06-20 /pmc/articles/PMC6584512/ /pubmed/31216984 http://dx.doi.org/10.1186/s12859-019-2837-y Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Alokshiya, Mohammed Salem, Saeed Abed, Fidaa A linear delay algorithm for enumerating all connected induced subgraphs |
title | A linear delay algorithm for enumerating all connected induced subgraphs |
title_full | A linear delay algorithm for enumerating all connected induced subgraphs |
title_fullStr | A linear delay algorithm for enumerating all connected induced subgraphs |
title_full_unstemmed | A linear delay algorithm for enumerating all connected induced subgraphs |
title_short | A linear delay algorithm for enumerating all connected induced subgraphs |
title_sort | linear delay algorithm for enumerating all connected induced subgraphs |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6584512/ https://www.ncbi.nlm.nih.gov/pubmed/31216984 http://dx.doi.org/10.1186/s12859-019-2837-y |
work_keys_str_mv | AT alokshiyamohammed alineardelayalgorithmforenumeratingallconnectedinducedsubgraphs AT salemsaeed alineardelayalgorithmforenumeratingallconnectedinducedsubgraphs AT abedfidaa alineardelayalgorithmforenumeratingallconnectedinducedsubgraphs AT alokshiyamohammed lineardelayalgorithmforenumeratingallconnectedinducedsubgraphs AT salemsaeed lineardelayalgorithmforenumeratingallconnectedinducedsubgraphs AT abedfidaa lineardelayalgorithmforenumeratingallconnectedinducedsubgraphs |