Cargando…

An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network

BACKGROUND: Drug-disease associations (DDAs) can provide important information for exploring the potential efficacy of drugs. However, up to now, there are still few DDAs verified by experiments. Previous evidence indicates that the combination of information would be conducive to the discovery of n...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Hanjing, Huang, Yabing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8726520/
https://www.ncbi.nlm.nih.gov/pubmed/34983364
http://dx.doi.org/10.1186/s12859-021-04553-2
_version_ 1784626343561920512
author Jiang, Hanjing
Huang, Yabing
author_facet Jiang, Hanjing
Huang, Yabing
author_sort Jiang, Hanjing
collection PubMed
description BACKGROUND: Drug-disease associations (DDAs) can provide important information for exploring the potential efficacy of drugs. However, up to now, there are still few DDAs verified by experiments. Previous evidence indicates that the combination of information would be conducive to the discovery of new DDAs. How to integrate different biological data sources and identify the most effective drugs for a certain disease based on drug-disease coupled mechanisms is still a challenging problem. RESULTS: In this paper, we proposed a novel computation model for DDA predictions based on graph representation learning over multi-biomolecular network (GRLMN). More specifically, we firstly constructed a large-scale molecular association network (MAN) by integrating the associations among drugs, diseases, proteins, miRNAs, and lncRNAs. Then, a graph embedding model was used to learn vector representations for all drugs and diseases in MAN. Finally, the combined features were fed to a random forest (RF) model to predict new DDAs. The proposed model was evaluated on the SCMFDD-S data set using five-fold cross-validation. Experiment results showed that GRLMN model was very accurate with the area under the ROC curve (AUC) of 87.9%, which outperformed all previous works in terms of both accuracy and AUC in benchmark dataset. To further verify the high performance of GRLMN, we carried out two case studies for two common diseases. As a result, in the ranking of drugs that were predicted to be related to certain diseases (such as kidney disease and fever), 15 of the top 20 drugs have been experimentally confirmed. CONCLUSIONS: The experimental results show that our model has good performance in the prediction of DDA. GRLMN is an effective prioritization tool for screening the reliable DDAs for follow-up studies concerning their participation in drug reposition.
format Online
Article
Text
id pubmed-8726520
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-87265202022-01-05 An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network Jiang, Hanjing Huang, Yabing BMC Bioinformatics Research BACKGROUND: Drug-disease associations (DDAs) can provide important information for exploring the potential efficacy of drugs. However, up to now, there are still few DDAs verified by experiments. Previous evidence indicates that the combination of information would be conducive to the discovery of new DDAs. How to integrate different biological data sources and identify the most effective drugs for a certain disease based on drug-disease coupled mechanisms is still a challenging problem. RESULTS: In this paper, we proposed a novel computation model for DDA predictions based on graph representation learning over multi-biomolecular network (GRLMN). More specifically, we firstly constructed a large-scale molecular association network (MAN) by integrating the associations among drugs, diseases, proteins, miRNAs, and lncRNAs. Then, a graph embedding model was used to learn vector representations for all drugs and diseases in MAN. Finally, the combined features were fed to a random forest (RF) model to predict new DDAs. The proposed model was evaluated on the SCMFDD-S data set using five-fold cross-validation. Experiment results showed that GRLMN model was very accurate with the area under the ROC curve (AUC) of 87.9%, which outperformed all previous works in terms of both accuracy and AUC in benchmark dataset. To further verify the high performance of GRLMN, we carried out two case studies for two common diseases. As a result, in the ranking of drugs that were predicted to be related to certain diseases (such as kidney disease and fever), 15 of the top 20 drugs have been experimentally confirmed. CONCLUSIONS: The experimental results show that our model has good performance in the prediction of DDA. GRLMN is an effective prioritization tool for screening the reliable DDAs for follow-up studies concerning their participation in drug reposition. BioMed Central 2022-01-04 /pmc/articles/PMC8726520/ /pubmed/34983364 http://dx.doi.org/10.1186/s12859-021-04553-2 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Jiang, Hanjing
Huang, Yabing
An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network
title An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network
title_full An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network
title_fullStr An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network
title_full_unstemmed An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network
title_short An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network
title_sort effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8726520/
https://www.ncbi.nlm.nih.gov/pubmed/34983364
http://dx.doi.org/10.1186/s12859-021-04553-2
work_keys_str_mv AT jianghanjing aneffectivedrugdiseaseassociationspredictionmodelbasedongraphicrepresentationlearningovermultibiomolecularnetwork
AT huangyabing aneffectivedrugdiseaseassociationspredictionmodelbasedongraphicrepresentationlearningovermultibiomolecularnetwork
AT jianghanjing effectivedrugdiseaseassociationspredictionmodelbasedongraphicrepresentationlearningovermultibiomolecularnetwork
AT huangyabing effectivedrugdiseaseassociationspredictionmodelbasedongraphicrepresentationlearningovermultibiomolecularnetwork