Cargando…

Inductive inference of gene regulatory network using supervised and semi-supervised graph neural networks

Discovering gene regulatory relationships and reconstructing gene regulatory networks (GRN) based on gene expression data is a classical, long-standing computational challenge in bioinformatics. Computationally inferring a possible regulatory relationship between two genes can be formulated as a lin...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Juexin, Ma, Anjun, Ma, Qin, Xu, Dong, Joshi, Trupti
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7677691/
https://www.ncbi.nlm.nih.gov/pubmed/33294129
http://dx.doi.org/10.1016/j.csbj.2020.10.022
_version_ 1783612026777501696
author Wang, Juexin
Ma, Anjun
Ma, Qin
Xu, Dong
Joshi, Trupti
author_facet Wang, Juexin
Ma, Anjun
Ma, Qin
Xu, Dong
Joshi, Trupti
author_sort Wang, Juexin
collection PubMed
description Discovering gene regulatory relationships and reconstructing gene regulatory networks (GRN) based on gene expression data is a classical, long-standing computational challenge in bioinformatics. Computationally inferring a possible regulatory relationship between two genes can be formulated as a link prediction problem between two nodes in a graph. Graph neural network (GNN) provides an opportunity to construct GRN by integrating topological neighbor propagation through the whole gene network. We propose an end-to-end gene regulatory graph neural network (GRGNN) approach to reconstruct GRNs from scratch utilizing the gene expression data, in both a supervised and a semi-supervised framework. To get better inductive generalization capability, GRN inference is formulated as a graph classification problem, to distinguish whether a subgraph centered at two nodes contains the link between the two nodes. A linked pair between a transcription factor (TF) and a target gene, and their neighbors are labeled as a positive subgraph, while an unlinked TF and target gene pair and their neighbors are labeled as a negative subgraph. A GNN model is constructed with node features from both explicit gene expression and graph embedding. We demonstrate a noisy starting graph structure built from partial information, such as Pearson’s correlation coefficient and mutual information can help guide the GRN inference through an appropriate ensemble technique. Furthermore, a semi-supervised scheme is implemented to increase the quality of the classifier. When compared with established methods, GRGNN achieved state-of-the-art performance on the DREAM5 GRN inference benchmarks. GRGNN is publicly available at https://github.com/juexinwang/GRGNN.
format Online
Article
Text
id pubmed-7677691
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-76776912020-12-07 Inductive inference of gene regulatory network using supervised and semi-supervised graph neural networks Wang, Juexin Ma, Anjun Ma, Qin Xu, Dong Joshi, Trupti Comput Struct Biotechnol J Research Article Discovering gene regulatory relationships and reconstructing gene regulatory networks (GRN) based on gene expression data is a classical, long-standing computational challenge in bioinformatics. Computationally inferring a possible regulatory relationship between two genes can be formulated as a link prediction problem between two nodes in a graph. Graph neural network (GNN) provides an opportunity to construct GRN by integrating topological neighbor propagation through the whole gene network. We propose an end-to-end gene regulatory graph neural network (GRGNN) approach to reconstruct GRNs from scratch utilizing the gene expression data, in both a supervised and a semi-supervised framework. To get better inductive generalization capability, GRN inference is formulated as a graph classification problem, to distinguish whether a subgraph centered at two nodes contains the link between the two nodes. A linked pair between a transcription factor (TF) and a target gene, and their neighbors are labeled as a positive subgraph, while an unlinked TF and target gene pair and their neighbors are labeled as a negative subgraph. A GNN model is constructed with node features from both explicit gene expression and graph embedding. We demonstrate a noisy starting graph structure built from partial information, such as Pearson’s correlation coefficient and mutual information can help guide the GRN inference through an appropriate ensemble technique. Furthermore, a semi-supervised scheme is implemented to increase the quality of the classifier. When compared with established methods, GRGNN achieved state-of-the-art performance on the DREAM5 GRN inference benchmarks. GRGNN is publicly available at https://github.com/juexinwang/GRGNN. Research Network of Computational and Structural Biotechnology 2020-11-05 /pmc/articles/PMC7677691/ /pubmed/33294129 http://dx.doi.org/10.1016/j.csbj.2020.10.022 Text en © 2020 The Author(s) http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Wang, Juexin
Ma, Anjun
Ma, Qin
Xu, Dong
Joshi, Trupti
Inductive inference of gene regulatory network using supervised and semi-supervised graph neural networks
title Inductive inference of gene regulatory network using supervised and semi-supervised graph neural networks
title_full Inductive inference of gene regulatory network using supervised and semi-supervised graph neural networks
title_fullStr Inductive inference of gene regulatory network using supervised and semi-supervised graph neural networks
title_full_unstemmed Inductive inference of gene regulatory network using supervised and semi-supervised graph neural networks
title_short Inductive inference of gene regulatory network using supervised and semi-supervised graph neural networks
title_sort inductive inference of gene regulatory network using supervised and semi-supervised graph neural networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7677691/
https://www.ncbi.nlm.nih.gov/pubmed/33294129
http://dx.doi.org/10.1016/j.csbj.2020.10.022
work_keys_str_mv AT wangjuexin inductiveinferenceofgeneregulatorynetworkusingsupervisedandsemisupervisedgraphneuralnetworks
AT maanjun inductiveinferenceofgeneregulatorynetworkusingsupervisedandsemisupervisedgraphneuralnetworks
AT maqin inductiveinferenceofgeneregulatorynetworkusingsupervisedandsemisupervisedgraphneuralnetworks
AT xudong inductiveinferenceofgeneregulatorynetworkusingsupervisedandsemisupervisedgraphneuralnetworks
AT joshitrupti inductiveinferenceofgeneregulatorynetworkusingsupervisedandsemisupervisedgraphneuralnetworks