Cargando…

Significant Subgraph Detection in Multi-omics Networks for Disease Pathway Identification

Chronic obstructive pulmonary disease (COPD) is one of the leading causes of death in the United States. COPD represents one of many areas of research where identifying complex pathways and networks of interacting biomarkers is an important avenue toward studying disease progression and potentially...

Descripción completa

Detalles Bibliográficos
Autores principales: Abdel-Hafiz, Mohamed, Najafi, Mesbah, Helmi, Shahab, Pratte, Katherine A., Zhuang, Yonghua, Liu, Weixuan, Kechris, Katerina J., Bowler, Russell P., Lange, Leslie, Banaei-Kashani, Farnoush
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9256965/
https://www.ncbi.nlm.nih.gov/pubmed/35811829
http://dx.doi.org/10.3389/fdata.2022.894632
_version_ 1784741230526070784
author Abdel-Hafiz, Mohamed
Najafi, Mesbah
Helmi, Shahab
Pratte, Katherine A.
Zhuang, Yonghua
Liu, Weixuan
Kechris, Katerina J.
Bowler, Russell P.
Lange, Leslie
Banaei-Kashani, Farnoush
author_facet Abdel-Hafiz, Mohamed
Najafi, Mesbah
Helmi, Shahab
Pratte, Katherine A.
Zhuang, Yonghua
Liu, Weixuan
Kechris, Katerina J.
Bowler, Russell P.
Lange, Leslie
Banaei-Kashani, Farnoush
author_sort Abdel-Hafiz, Mohamed
collection PubMed
description Chronic obstructive pulmonary disease (COPD) is one of the leading causes of death in the United States. COPD represents one of many areas of research where identifying complex pathways and networks of interacting biomarkers is an important avenue toward studying disease progression and potentially discovering cures. Recently, sparse multiple canonical correlation network analysis (SmCCNet) was developed to identify complex relationships between omics associated with a disease phenotype, such as lung function. SmCCNet uses two sets of omics datasets and an associated output phenotypes to generate a multi-omics graph, which can then be used to explore relationships between omics in the context of a disease. Detecting significant subgraphs within this multi-omics network, i.e., subgraphs which exhibit high correlation to a disease phenotype and high inter-connectivity, can help clinicians identify complex biological relationships involved in disease progression. The current approach to identifying significant subgraphs relies on hierarchical clustering, which can be used to inform clinicians about important pathways involved in the disease or phenotype of interest. The reliance on a hierarchical clustering approach can hinder subgraph quality by biasing toward finding more compact subgraphs and removing larger significant subgraphs. This study aims to introduce new significant subgraph detection techniques. In particular, we introduce two subgraph detection methods, dubbed Correlated PageRank and Correlated Louvain, by extending the Personalized PageRank Clustering and Louvain algorithms, as well as a hybrid approach combining the two proposed methods, and compare them to the hierarchical method currently in use. The proposed methods show significant improvement in the quality of the subgraphs produced when compared to the current state of the art.
format Online
Article
Text
id pubmed-9256965
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-92569652022-07-07 Significant Subgraph Detection in Multi-omics Networks for Disease Pathway Identification Abdel-Hafiz, Mohamed Najafi, Mesbah Helmi, Shahab Pratte, Katherine A. Zhuang, Yonghua Liu, Weixuan Kechris, Katerina J. Bowler, Russell P. Lange, Leslie Banaei-Kashani, Farnoush Front Big Data Big Data Chronic obstructive pulmonary disease (COPD) is one of the leading causes of death in the United States. COPD represents one of many areas of research where identifying complex pathways and networks of interacting biomarkers is an important avenue toward studying disease progression and potentially discovering cures. Recently, sparse multiple canonical correlation network analysis (SmCCNet) was developed to identify complex relationships between omics associated with a disease phenotype, such as lung function. SmCCNet uses two sets of omics datasets and an associated output phenotypes to generate a multi-omics graph, which can then be used to explore relationships between omics in the context of a disease. Detecting significant subgraphs within this multi-omics network, i.e., subgraphs which exhibit high correlation to a disease phenotype and high inter-connectivity, can help clinicians identify complex biological relationships involved in disease progression. The current approach to identifying significant subgraphs relies on hierarchical clustering, which can be used to inform clinicians about important pathways involved in the disease or phenotype of interest. The reliance on a hierarchical clustering approach can hinder subgraph quality by biasing toward finding more compact subgraphs and removing larger significant subgraphs. This study aims to introduce new significant subgraph detection techniques. In particular, we introduce two subgraph detection methods, dubbed Correlated PageRank and Correlated Louvain, by extending the Personalized PageRank Clustering and Louvain algorithms, as well as a hybrid approach combining the two proposed methods, and compare them to the hierarchical method currently in use. The proposed methods show significant improvement in the quality of the subgraphs produced when compared to the current state of the art. Frontiers Media S.A. 2022-06-22 /pmc/articles/PMC9256965/ /pubmed/35811829 http://dx.doi.org/10.3389/fdata.2022.894632 Text en Copyright © 2022 Abdel-Hafiz, Najafi, Helmi, Pratte, Zhuang, Liu, Kechris, Bowler, Lange and Banaei-Kashani. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Big Data
Abdel-Hafiz, Mohamed
Najafi, Mesbah
Helmi, Shahab
Pratte, Katherine A.
Zhuang, Yonghua
Liu, Weixuan
Kechris, Katerina J.
Bowler, Russell P.
Lange, Leslie
Banaei-Kashani, Farnoush
Significant Subgraph Detection in Multi-omics Networks for Disease Pathway Identification
title Significant Subgraph Detection in Multi-omics Networks for Disease Pathway Identification
title_full Significant Subgraph Detection in Multi-omics Networks for Disease Pathway Identification
title_fullStr Significant Subgraph Detection in Multi-omics Networks for Disease Pathway Identification
title_full_unstemmed Significant Subgraph Detection in Multi-omics Networks for Disease Pathway Identification
title_short Significant Subgraph Detection in Multi-omics Networks for Disease Pathway Identification
title_sort significant subgraph detection in multi-omics networks for disease pathway identification
topic Big Data
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9256965/
https://www.ncbi.nlm.nih.gov/pubmed/35811829
http://dx.doi.org/10.3389/fdata.2022.894632
work_keys_str_mv AT abdelhafizmohamed significantsubgraphdetectioninmultiomicsnetworksfordiseasepathwayidentification
AT najafimesbah significantsubgraphdetectioninmultiomicsnetworksfordiseasepathwayidentification
AT helmishahab significantsubgraphdetectioninmultiomicsnetworksfordiseasepathwayidentification
AT prattekatherinea significantsubgraphdetectioninmultiomicsnetworksfordiseasepathwayidentification
AT zhuangyonghua significantsubgraphdetectioninmultiomicsnetworksfordiseasepathwayidentification
AT liuweixuan significantsubgraphdetectioninmultiomicsnetworksfordiseasepathwayidentification
AT kechriskaterinaj significantsubgraphdetectioninmultiomicsnetworksfordiseasepathwayidentification
AT bowlerrussellp significantsubgraphdetectioninmultiomicsnetworksfordiseasepathwayidentification
AT langeleslie significantsubgraphdetectioninmultiomicsnetworksfordiseasepathwayidentification
AT banaeikashanifarnoush significantsubgraphdetectioninmultiomicsnetworksfordiseasepathwayidentification