Cargando…
Identifying Candidate Gene–Disease Associations via Graph Neural Networks
Real-world objects are usually defined in terms of their own relationships or connections. A graph (or network) naturally expresses this model though nodes and edges. In biology, depending on what the nodes and edges represent, we may classify several types of networks, gene–disease associations (GD...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10296901/ https://www.ncbi.nlm.nih.gov/pubmed/37372253 http://dx.doi.org/10.3390/e25060909 |
_version_ | 1785063757317144576 |
---|---|
author | Cinaglia, Pietro Cannataro, Mario |
author_facet | Cinaglia, Pietro Cannataro, Mario |
author_sort | Cinaglia, Pietro |
collection | PubMed |
description | Real-world objects are usually defined in terms of their own relationships or connections. A graph (or network) naturally expresses this model though nodes and edges. In biology, depending on what the nodes and edges represent, we may classify several types of networks, gene–disease associations (GDAs) included. In this paper, we presented a solution based on a graph neural network (GNN) for the identification of candidate GDAs. We trained our model with an initial set of well-known and curated inter- and intra-relationships between genes and diseases. It was based on graph convolutions, making use of multiple convolutional layers and a point-wise non-linearity function following each layer. The embeddings were computed for the input network built on a set of GDAs to map each node into a vector of real numbers in a multidimensional space. Results showed an AUC of 95% for training, validation, and testing, that in the real case translated into a positive response for 93% of the Top-15 (highest dot product) candidate GDAs identified by our solution. The experimentation was conducted on the DisGeNET dataset, while the DiseaseGene Association Miner (DG-AssocMiner) dataset by Stanford’s BioSNAP was also processed for performance evaluation only. |
format | Online Article Text |
id | pubmed-10296901 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-102969012023-06-28 Identifying Candidate Gene–Disease Associations via Graph Neural Networks Cinaglia, Pietro Cannataro, Mario Entropy (Basel) Article Real-world objects are usually defined in terms of their own relationships or connections. A graph (or network) naturally expresses this model though nodes and edges. In biology, depending on what the nodes and edges represent, we may classify several types of networks, gene–disease associations (GDAs) included. In this paper, we presented a solution based on a graph neural network (GNN) for the identification of candidate GDAs. We trained our model with an initial set of well-known and curated inter- and intra-relationships between genes and diseases. It was based on graph convolutions, making use of multiple convolutional layers and a point-wise non-linearity function following each layer. The embeddings were computed for the input network built on a set of GDAs to map each node into a vector of real numbers in a multidimensional space. Results showed an AUC of 95% for training, validation, and testing, that in the real case translated into a positive response for 93% of the Top-15 (highest dot product) candidate GDAs identified by our solution. The experimentation was conducted on the DisGeNET dataset, while the DiseaseGene Association Miner (DG-AssocMiner) dataset by Stanford’s BioSNAP was also processed for performance evaluation only. MDPI 2023-06-07 /pmc/articles/PMC10296901/ /pubmed/37372253 http://dx.doi.org/10.3390/e25060909 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Cinaglia, Pietro Cannataro, Mario Identifying Candidate Gene–Disease Associations via Graph Neural Networks |
title | Identifying Candidate Gene–Disease Associations via Graph Neural Networks |
title_full | Identifying Candidate Gene–Disease Associations via Graph Neural Networks |
title_fullStr | Identifying Candidate Gene–Disease Associations via Graph Neural Networks |
title_full_unstemmed | Identifying Candidate Gene–Disease Associations via Graph Neural Networks |
title_short | Identifying Candidate Gene–Disease Associations via Graph Neural Networks |
title_sort | identifying candidate gene–disease associations via graph neural networks |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10296901/ https://www.ncbi.nlm.nih.gov/pubmed/37372253 http://dx.doi.org/10.3390/e25060909 |
work_keys_str_mv | AT cinagliapietro identifyingcandidategenediseaseassociationsviagraphneuralnetworks AT cannataromario identifyingcandidategenediseaseassociationsviagraphneuralnetworks |