Cargando…
An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network
BACKGROUND: Drug-disease associations (DDAs) can provide important information for exploring the potential efficacy of drugs. However, up to now, there are still few DDAs verified by experiments. Previous evidence indicates that the combination of information would be conducive to the discovery of n...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8726520/ https://www.ncbi.nlm.nih.gov/pubmed/34983364 http://dx.doi.org/10.1186/s12859-021-04553-2 |
_version_ | 1784626343561920512 |
---|---|
author | Jiang, Hanjing Huang, Yabing |
author_facet | Jiang, Hanjing Huang, Yabing |
author_sort | Jiang, Hanjing |
collection | PubMed |
description | BACKGROUND: Drug-disease associations (DDAs) can provide important information for exploring the potential efficacy of drugs. However, up to now, there are still few DDAs verified by experiments. Previous evidence indicates that the combination of information would be conducive to the discovery of new DDAs. How to integrate different biological data sources and identify the most effective drugs for a certain disease based on drug-disease coupled mechanisms is still a challenging problem. RESULTS: In this paper, we proposed a novel computation model for DDA predictions based on graph representation learning over multi-biomolecular network (GRLMN). More specifically, we firstly constructed a large-scale molecular association network (MAN) by integrating the associations among drugs, diseases, proteins, miRNAs, and lncRNAs. Then, a graph embedding model was used to learn vector representations for all drugs and diseases in MAN. Finally, the combined features were fed to a random forest (RF) model to predict new DDAs. The proposed model was evaluated on the SCMFDD-S data set using five-fold cross-validation. Experiment results showed that GRLMN model was very accurate with the area under the ROC curve (AUC) of 87.9%, which outperformed all previous works in terms of both accuracy and AUC in benchmark dataset. To further verify the high performance of GRLMN, we carried out two case studies for two common diseases. As a result, in the ranking of drugs that were predicted to be related to certain diseases (such as kidney disease and fever), 15 of the top 20 drugs have been experimentally confirmed. CONCLUSIONS: The experimental results show that our model has good performance in the prediction of DDA. GRLMN is an effective prioritization tool for screening the reliable DDAs for follow-up studies concerning their participation in drug reposition. |
format | Online Article Text |
id | pubmed-8726520 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-87265202022-01-05 An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network Jiang, Hanjing Huang, Yabing BMC Bioinformatics Research BACKGROUND: Drug-disease associations (DDAs) can provide important information for exploring the potential efficacy of drugs. However, up to now, there are still few DDAs verified by experiments. Previous evidence indicates that the combination of information would be conducive to the discovery of new DDAs. How to integrate different biological data sources and identify the most effective drugs for a certain disease based on drug-disease coupled mechanisms is still a challenging problem. RESULTS: In this paper, we proposed a novel computation model for DDA predictions based on graph representation learning over multi-biomolecular network (GRLMN). More specifically, we firstly constructed a large-scale molecular association network (MAN) by integrating the associations among drugs, diseases, proteins, miRNAs, and lncRNAs. Then, a graph embedding model was used to learn vector representations for all drugs and diseases in MAN. Finally, the combined features were fed to a random forest (RF) model to predict new DDAs. The proposed model was evaluated on the SCMFDD-S data set using five-fold cross-validation. Experiment results showed that GRLMN model was very accurate with the area under the ROC curve (AUC) of 87.9%, which outperformed all previous works in terms of both accuracy and AUC in benchmark dataset. To further verify the high performance of GRLMN, we carried out two case studies for two common diseases. As a result, in the ranking of drugs that were predicted to be related to certain diseases (such as kidney disease and fever), 15 of the top 20 drugs have been experimentally confirmed. CONCLUSIONS: The experimental results show that our model has good performance in the prediction of DDA. GRLMN is an effective prioritization tool for screening the reliable DDAs for follow-up studies concerning their participation in drug reposition. BioMed Central 2022-01-04 /pmc/articles/PMC8726520/ /pubmed/34983364 http://dx.doi.org/10.1186/s12859-021-04553-2 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Jiang, Hanjing Huang, Yabing An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network |
title | An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network |
title_full | An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network |
title_fullStr | An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network |
title_full_unstemmed | An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network |
title_short | An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network |
title_sort | effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8726520/ https://www.ncbi.nlm.nih.gov/pubmed/34983364 http://dx.doi.org/10.1186/s12859-021-04553-2 |
work_keys_str_mv | AT jianghanjing aneffectivedrugdiseaseassociationspredictionmodelbasedongraphicrepresentationlearningovermultibiomolecularnetwork AT huangyabing aneffectivedrugdiseaseassociationspredictionmodelbasedongraphicrepresentationlearningovermultibiomolecularnetwork AT jianghanjing effectivedrugdiseaseassociationspredictionmodelbasedongraphicrepresentationlearningovermultibiomolecularnetwork AT huangyabing effectivedrugdiseaseassociationspredictionmodelbasedongraphicrepresentationlearningovermultibiomolecularnetwork |