Cargando…

Identifying Candidate Gene–Disease Associations via Graph Neural Networks

Real-world objects are usually defined in terms of their own relationships or connections. A graph (or network) naturally expresses this model though nodes and edges. In biology, depending on what the nodes and edges represent, we may classify several types of networks, gene–disease associations (GD...

Descripción completa

Detalles Bibliográficos
Autores principales: Cinaglia, Pietro, Cannataro, Mario
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10296901/
https://www.ncbi.nlm.nih.gov/pubmed/37372253
http://dx.doi.org/10.3390/e25060909
_version_ 1785063757317144576
author Cinaglia, Pietro
Cannataro, Mario
author_facet Cinaglia, Pietro
Cannataro, Mario
author_sort Cinaglia, Pietro
collection PubMed
description Real-world objects are usually defined in terms of their own relationships or connections. A graph (or network) naturally expresses this model though nodes and edges. In biology, depending on what the nodes and edges represent, we may classify several types of networks, gene–disease associations (GDAs) included. In this paper, we presented a solution based on a graph neural network (GNN) for the identification of candidate GDAs. We trained our model with an initial set of well-known and curated inter- and intra-relationships between genes and diseases. It was based on graph convolutions, making use of multiple convolutional layers and a point-wise non-linearity function following each layer. The embeddings were computed for the input network built on a set of GDAs to map each node into a vector of real numbers in a multidimensional space. Results showed an AUC of 95% for training, validation, and testing, that in the real case translated into a positive response for 93% of the Top-15 (highest dot product) candidate GDAs identified by our solution. The experimentation was conducted on the DisGeNET dataset, while the DiseaseGene Association Miner (DG-AssocMiner) dataset by Stanford’s BioSNAP was also processed for performance evaluation only.
format Online
Article
Text
id pubmed-10296901
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-102969012023-06-28 Identifying Candidate Gene–Disease Associations via Graph Neural Networks Cinaglia, Pietro Cannataro, Mario Entropy (Basel) Article Real-world objects are usually defined in terms of their own relationships or connections. A graph (or network) naturally expresses this model though nodes and edges. In biology, depending on what the nodes and edges represent, we may classify several types of networks, gene–disease associations (GDAs) included. In this paper, we presented a solution based on a graph neural network (GNN) for the identification of candidate GDAs. We trained our model with an initial set of well-known and curated inter- and intra-relationships between genes and diseases. It was based on graph convolutions, making use of multiple convolutional layers and a point-wise non-linearity function following each layer. The embeddings were computed for the input network built on a set of GDAs to map each node into a vector of real numbers in a multidimensional space. Results showed an AUC of 95% for training, validation, and testing, that in the real case translated into a positive response for 93% of the Top-15 (highest dot product) candidate GDAs identified by our solution. The experimentation was conducted on the DisGeNET dataset, while the DiseaseGene Association Miner (DG-AssocMiner) dataset by Stanford’s BioSNAP was also processed for performance evaluation only. MDPI 2023-06-07 /pmc/articles/PMC10296901/ /pubmed/37372253 http://dx.doi.org/10.3390/e25060909 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Cinaglia, Pietro
Cannataro, Mario
Identifying Candidate Gene–Disease Associations via Graph Neural Networks
title Identifying Candidate Gene–Disease Associations via Graph Neural Networks
title_full Identifying Candidate Gene–Disease Associations via Graph Neural Networks
title_fullStr Identifying Candidate Gene–Disease Associations via Graph Neural Networks
title_full_unstemmed Identifying Candidate Gene–Disease Associations via Graph Neural Networks
title_short Identifying Candidate Gene–Disease Associations via Graph Neural Networks
title_sort identifying candidate gene–disease associations via graph neural networks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10296901/
https://www.ncbi.nlm.nih.gov/pubmed/37372253
http://dx.doi.org/10.3390/e25060909
work_keys_str_mv AT cinagliapietro identifyingcandidategenediseaseassociationsviagraphneuralnetworks
AT cannataromario identifyingcandidategenediseaseassociationsviagraphneuralnetworks