Cargando…
Significant Subgraph Detection in Multi-omics Networks for Disease Pathway Identification
Chronic obstructive pulmonary disease (COPD) is one of the leading causes of death in the United States. COPD represents one of many areas of research where identifying complex pathways and networks of interacting biomarkers is an important avenue toward studying disease progression and potentially...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9256965/ https://www.ncbi.nlm.nih.gov/pubmed/35811829 http://dx.doi.org/10.3389/fdata.2022.894632 |
_version_ | 1784741230526070784 |
---|---|
author | Abdel-Hafiz, Mohamed Najafi, Mesbah Helmi, Shahab Pratte, Katherine A. Zhuang, Yonghua Liu, Weixuan Kechris, Katerina J. Bowler, Russell P. Lange, Leslie Banaei-Kashani, Farnoush |
author_facet | Abdel-Hafiz, Mohamed Najafi, Mesbah Helmi, Shahab Pratte, Katherine A. Zhuang, Yonghua Liu, Weixuan Kechris, Katerina J. Bowler, Russell P. Lange, Leslie Banaei-Kashani, Farnoush |
author_sort | Abdel-Hafiz, Mohamed |
collection | PubMed |
description | Chronic obstructive pulmonary disease (COPD) is one of the leading causes of death in the United States. COPD represents one of many areas of research where identifying complex pathways and networks of interacting biomarkers is an important avenue toward studying disease progression and potentially discovering cures. Recently, sparse multiple canonical correlation network analysis (SmCCNet) was developed to identify complex relationships between omics associated with a disease phenotype, such as lung function. SmCCNet uses two sets of omics datasets and an associated output phenotypes to generate a multi-omics graph, which can then be used to explore relationships between omics in the context of a disease. Detecting significant subgraphs within this multi-omics network, i.e., subgraphs which exhibit high correlation to a disease phenotype and high inter-connectivity, can help clinicians identify complex biological relationships involved in disease progression. The current approach to identifying significant subgraphs relies on hierarchical clustering, which can be used to inform clinicians about important pathways involved in the disease or phenotype of interest. The reliance on a hierarchical clustering approach can hinder subgraph quality by biasing toward finding more compact subgraphs and removing larger significant subgraphs. This study aims to introduce new significant subgraph detection techniques. In particular, we introduce two subgraph detection methods, dubbed Correlated PageRank and Correlated Louvain, by extending the Personalized PageRank Clustering and Louvain algorithms, as well as a hybrid approach combining the two proposed methods, and compare them to the hierarchical method currently in use. The proposed methods show significant improvement in the quality of the subgraphs produced when compared to the current state of the art. |
format | Online Article Text |
id | pubmed-9256965 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-92569652022-07-07 Significant Subgraph Detection in Multi-omics Networks for Disease Pathway Identification Abdel-Hafiz, Mohamed Najafi, Mesbah Helmi, Shahab Pratte, Katherine A. Zhuang, Yonghua Liu, Weixuan Kechris, Katerina J. Bowler, Russell P. Lange, Leslie Banaei-Kashani, Farnoush Front Big Data Big Data Chronic obstructive pulmonary disease (COPD) is one of the leading causes of death in the United States. COPD represents one of many areas of research where identifying complex pathways and networks of interacting biomarkers is an important avenue toward studying disease progression and potentially discovering cures. Recently, sparse multiple canonical correlation network analysis (SmCCNet) was developed to identify complex relationships between omics associated with a disease phenotype, such as lung function. SmCCNet uses two sets of omics datasets and an associated output phenotypes to generate a multi-omics graph, which can then be used to explore relationships between omics in the context of a disease. Detecting significant subgraphs within this multi-omics network, i.e., subgraphs which exhibit high correlation to a disease phenotype and high inter-connectivity, can help clinicians identify complex biological relationships involved in disease progression. The current approach to identifying significant subgraphs relies on hierarchical clustering, which can be used to inform clinicians about important pathways involved in the disease or phenotype of interest. The reliance on a hierarchical clustering approach can hinder subgraph quality by biasing toward finding more compact subgraphs and removing larger significant subgraphs. This study aims to introduce new significant subgraph detection techniques. In particular, we introduce two subgraph detection methods, dubbed Correlated PageRank and Correlated Louvain, by extending the Personalized PageRank Clustering and Louvain algorithms, as well as a hybrid approach combining the two proposed methods, and compare them to the hierarchical method currently in use. The proposed methods show significant improvement in the quality of the subgraphs produced when compared to the current state of the art. Frontiers Media S.A. 2022-06-22 /pmc/articles/PMC9256965/ /pubmed/35811829 http://dx.doi.org/10.3389/fdata.2022.894632 Text en Copyright © 2022 Abdel-Hafiz, Najafi, Helmi, Pratte, Zhuang, Liu, Kechris, Bowler, Lange and Banaei-Kashani. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Big Data Abdel-Hafiz, Mohamed Najafi, Mesbah Helmi, Shahab Pratte, Katherine A. Zhuang, Yonghua Liu, Weixuan Kechris, Katerina J. Bowler, Russell P. Lange, Leslie Banaei-Kashani, Farnoush Significant Subgraph Detection in Multi-omics Networks for Disease Pathway Identification |
title | Significant Subgraph Detection in Multi-omics Networks for Disease Pathway Identification |
title_full | Significant Subgraph Detection in Multi-omics Networks for Disease Pathway Identification |
title_fullStr | Significant Subgraph Detection in Multi-omics Networks for Disease Pathway Identification |
title_full_unstemmed | Significant Subgraph Detection in Multi-omics Networks for Disease Pathway Identification |
title_short | Significant Subgraph Detection in Multi-omics Networks for Disease Pathway Identification |
title_sort | significant subgraph detection in multi-omics networks for disease pathway identification |
topic | Big Data |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9256965/ https://www.ncbi.nlm.nih.gov/pubmed/35811829 http://dx.doi.org/10.3389/fdata.2022.894632 |
work_keys_str_mv | AT abdelhafizmohamed significantsubgraphdetectioninmultiomicsnetworksfordiseasepathwayidentification AT najafimesbah significantsubgraphdetectioninmultiomicsnetworksfordiseasepathwayidentification AT helmishahab significantsubgraphdetectioninmultiomicsnetworksfordiseasepathwayidentification AT prattekatherinea significantsubgraphdetectioninmultiomicsnetworksfordiseasepathwayidentification AT zhuangyonghua significantsubgraphdetectioninmultiomicsnetworksfordiseasepathwayidentification AT liuweixuan significantsubgraphdetectioninmultiomicsnetworksfordiseasepathwayidentification AT kechriskaterinaj significantsubgraphdetectioninmultiomicsnetworksfordiseasepathwayidentification AT bowlerrussellp significantsubgraphdetectioninmultiomicsnetworksfordiseasepathwayidentification AT langeleslie significantsubgraphdetectioninmultiomicsnetworksfordiseasepathwayidentification AT banaeikashanifarnoush significantsubgraphdetectioninmultiomicsnetworksfordiseasepathwayidentification |