Cargando…

Self-supervised graph representation learning integrates multiple molecular networks and decodes gene-disease relationships

Leveraging molecular networks to discover disease-relevant modules is a long-standing challenge. With the accumulation of interactomes, there is a pressing need for powerful computational approaches to handle the inevitable noise and context-specific nature of biological networks. Here, we introduce...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yi, Sun, Zijun, He, Qiushun, Li, Jiwei, Ni, Ming, Yang, Meng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9868676/
https://www.ncbi.nlm.nih.gov/pubmed/36699743
http://dx.doi.org/10.1016/j.patter.2022.100651
_version_ 1784876593860050944
author Wang, Yi
Sun, Zijun
He, Qiushun
Li, Jiwei
Ni, Ming
Yang, Meng
author_facet Wang, Yi
Sun, Zijun
He, Qiushun
Li, Jiwei
Ni, Ming
Yang, Meng
author_sort Wang, Yi
collection PubMed
description Leveraging molecular networks to discover disease-relevant modules is a long-standing challenge. With the accumulation of interactomes, there is a pressing need for powerful computational approaches to handle the inevitable noise and context-specific nature of biological networks. Here, we introduce Graphene, a two-step self-supervised representation learning framework tailored to concisely integrate multiple molecular networks and adapted to gene functional analysis via downstream re-training. In practice, we first leverage GNN (graph neural network) pre-training techniques to obtain initial node embeddings followed by re-training Graphene using a graph attention architecture, achieving superior performance over competing methods for pathway gene recovery, disease gene reprioritization, and comorbidity prediction. Graphene successfully recapitulates tissue-specific gene expression across disease spectrum and demonstrates shared heritability of common mental disorders. Graphene can be updated with new interactomes or other omics features. Graphene holds promise to decipher gene function under network context and refine GWAS (genome-wide association study) hits and offers mechanistic insights via decoding diseases from genome to networks to phenotypes.
format Online
Article
Text
id pubmed-9868676
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-98686762023-01-24 Self-supervised graph representation learning integrates multiple molecular networks and decodes gene-disease relationships Wang, Yi Sun, Zijun He, Qiushun Li, Jiwei Ni, Ming Yang, Meng Patterns (N Y) Article Leveraging molecular networks to discover disease-relevant modules is a long-standing challenge. With the accumulation of interactomes, there is a pressing need for powerful computational approaches to handle the inevitable noise and context-specific nature of biological networks. Here, we introduce Graphene, a two-step self-supervised representation learning framework tailored to concisely integrate multiple molecular networks and adapted to gene functional analysis via downstream re-training. In practice, we first leverage GNN (graph neural network) pre-training techniques to obtain initial node embeddings followed by re-training Graphene using a graph attention architecture, achieving superior performance over competing methods for pathway gene recovery, disease gene reprioritization, and comorbidity prediction. Graphene successfully recapitulates tissue-specific gene expression across disease spectrum and demonstrates shared heritability of common mental disorders. Graphene can be updated with new interactomes or other omics features. Graphene holds promise to decipher gene function under network context and refine GWAS (genome-wide association study) hits and offers mechanistic insights via decoding diseases from genome to networks to phenotypes. Elsevier 2022-12-06 /pmc/articles/PMC9868676/ /pubmed/36699743 http://dx.doi.org/10.1016/j.patter.2022.100651 Text en © 2022 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Wang, Yi
Sun, Zijun
He, Qiushun
Li, Jiwei
Ni, Ming
Yang, Meng
Self-supervised graph representation learning integrates multiple molecular networks and decodes gene-disease relationships
title Self-supervised graph representation learning integrates multiple molecular networks and decodes gene-disease relationships
title_full Self-supervised graph representation learning integrates multiple molecular networks and decodes gene-disease relationships
title_fullStr Self-supervised graph representation learning integrates multiple molecular networks and decodes gene-disease relationships
title_full_unstemmed Self-supervised graph representation learning integrates multiple molecular networks and decodes gene-disease relationships
title_short Self-supervised graph representation learning integrates multiple molecular networks and decodes gene-disease relationships
title_sort self-supervised graph representation learning integrates multiple molecular networks and decodes gene-disease relationships
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9868676/
https://www.ncbi.nlm.nih.gov/pubmed/36699743
http://dx.doi.org/10.1016/j.patter.2022.100651
work_keys_str_mv AT wangyi selfsupervisedgraphrepresentationlearningintegratesmultiplemolecularnetworksanddecodesgenediseaserelationships
AT sunzijun selfsupervisedgraphrepresentationlearningintegratesmultiplemolecularnetworksanddecodesgenediseaserelationships
AT heqiushun selfsupervisedgraphrepresentationlearningintegratesmultiplemolecularnetworksanddecodesgenediseaserelationships
AT lijiwei selfsupervisedgraphrepresentationlearningintegratesmultiplemolecularnetworksanddecodesgenediseaserelationships
AT niming selfsupervisedgraphrepresentationlearningintegratesmultiplemolecularnetworksanddecodesgenediseaserelationships
AT yangmeng selfsupervisedgraphrepresentationlearningintegratesmultiplemolecularnetworksanddecodesgenediseaserelationships