Cargando…

Disease Module Identification Based on Representation Learning of Complex Networks Integrated From GWAS, eQTL Summaries, and Human Interactome

The study of disease-relevant gene modules is one of the main methods to discover disease pathway and potential drug targets. Recent studies have found that most disease proteins tend to form many separate connected components and scatter across the protein-protein interaction network. However, most...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Tao, Peng, Qidi, Liu, Bo, Liu, Yongzhuang, Wang, Yadong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7218106/
https://www.ncbi.nlm.nih.gov/pubmed/32435638
http://dx.doi.org/10.3389/fbioe.2020.00418
_version_ 1783532728146198528
author Wang, Tao
Peng, Qidi
Liu, Bo
Liu, Yongzhuang
Wang, Yadong
author_facet Wang, Tao
Peng, Qidi
Liu, Bo
Liu, Yongzhuang
Wang, Yadong
author_sort Wang, Tao
collection PubMed
description The study of disease-relevant gene modules is one of the main methods to discover disease pathway and potential drug targets. Recent studies have found that most disease proteins tend to form many separate connected components and scatter across the protein-protein interaction network. However, most of the research on discovering disease modules are biased toward well-studied seed genes, which tend to extend seed genes into a single connected subnetwork. In this paper, we propose N2V-HC, an algorithm framework aiming to unbiasedly discover the scattered disease modules based on deep representation learning of integrated multi-layer biological networks. Our method first predicts disease associated genes based on summary data of Genome-wide Association Studies (GWAS) and expression Quantitative Trait Loci (eQTL) studies, and generates an integrated network on the basis of human interactome. The features of nodes in the network are then extracted by deep representation learning. Hierarchical clustering with dynamic tree cut methods are applied to discover the modules that are enriched with disease associated genes. The evaluation on real networks and simulated networks show that N2V-HC performs better than existing methods in network module discovery. Case studies on Parkinson's disease and Alzheimer's disease, show that N2V-HC can be used to discover biological meaningful modules related to the pathways underlying complex diseases.
format Online
Article
Text
id pubmed-7218106
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-72181062020-05-20 Disease Module Identification Based on Representation Learning of Complex Networks Integrated From GWAS, eQTL Summaries, and Human Interactome Wang, Tao Peng, Qidi Liu, Bo Liu, Yongzhuang Wang, Yadong Front Bioeng Biotechnol Bioengineering and Biotechnology The study of disease-relevant gene modules is one of the main methods to discover disease pathway and potential drug targets. Recent studies have found that most disease proteins tend to form many separate connected components and scatter across the protein-protein interaction network. However, most of the research on discovering disease modules are biased toward well-studied seed genes, which tend to extend seed genes into a single connected subnetwork. In this paper, we propose N2V-HC, an algorithm framework aiming to unbiasedly discover the scattered disease modules based on deep representation learning of integrated multi-layer biological networks. Our method first predicts disease associated genes based on summary data of Genome-wide Association Studies (GWAS) and expression Quantitative Trait Loci (eQTL) studies, and generates an integrated network on the basis of human interactome. The features of nodes in the network are then extracted by deep representation learning. Hierarchical clustering with dynamic tree cut methods are applied to discover the modules that are enriched with disease associated genes. The evaluation on real networks and simulated networks show that N2V-HC performs better than existing methods in network module discovery. Case studies on Parkinson's disease and Alzheimer's disease, show that N2V-HC can be used to discover biological meaningful modules related to the pathways underlying complex diseases. Frontiers Media S.A. 2020-05-06 /pmc/articles/PMC7218106/ /pubmed/32435638 http://dx.doi.org/10.3389/fbioe.2020.00418 Text en Copyright © 2020 Wang, Peng, Liu, Liu and Wang. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioengineering and Biotechnology
Wang, Tao
Peng, Qidi
Liu, Bo
Liu, Yongzhuang
Wang, Yadong
Disease Module Identification Based on Representation Learning of Complex Networks Integrated From GWAS, eQTL Summaries, and Human Interactome
title Disease Module Identification Based on Representation Learning of Complex Networks Integrated From GWAS, eQTL Summaries, and Human Interactome
title_full Disease Module Identification Based on Representation Learning of Complex Networks Integrated From GWAS, eQTL Summaries, and Human Interactome
title_fullStr Disease Module Identification Based on Representation Learning of Complex Networks Integrated From GWAS, eQTL Summaries, and Human Interactome
title_full_unstemmed Disease Module Identification Based on Representation Learning of Complex Networks Integrated From GWAS, eQTL Summaries, and Human Interactome
title_short Disease Module Identification Based on Representation Learning of Complex Networks Integrated From GWAS, eQTL Summaries, and Human Interactome
title_sort disease module identification based on representation learning of complex networks integrated from gwas, eqtl summaries, and human interactome
topic Bioengineering and Biotechnology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7218106/
https://www.ncbi.nlm.nih.gov/pubmed/32435638
http://dx.doi.org/10.3389/fbioe.2020.00418
work_keys_str_mv AT wangtao diseasemoduleidentificationbasedonrepresentationlearningofcomplexnetworksintegratedfromgwaseqtlsummariesandhumaninteractome
AT pengqidi diseasemoduleidentificationbasedonrepresentationlearningofcomplexnetworksintegratedfromgwaseqtlsummariesandhumaninteractome
AT liubo diseasemoduleidentificationbasedonrepresentationlearningofcomplexnetworksintegratedfromgwaseqtlsummariesandhumaninteractome
AT liuyongzhuang diseasemoduleidentificationbasedonrepresentationlearningofcomplexnetworksintegratedfromgwaseqtlsummariesandhumaninteractome
AT wangyadong diseasemoduleidentificationbasedonrepresentationlearningofcomplexnetworksintegratedfromgwaseqtlsummariesandhumaninteractome