Cargando…
Predicting miRNA–disease associations using improved random walk with restart and integrating multiple similarities
Predicting beneficial and valuable miRNA–disease associations (MDAs) by doing biological laboratory experiments is costly and time-consuming. Proposing a forceful and meaningful computational method for predicting MDAs is essential and captivated many computer scientists in recent years. In this pap...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8548500/ https://www.ncbi.nlm.nih.gov/pubmed/34702958 http://dx.doi.org/10.1038/s41598-021-00677-w |
_version_ | 1784590583617028096 |
---|---|
author | Nguyen, Van Tinh Le, Thi Tu Kien Than, Khoat Tran, Dang Hung |
author_facet | Nguyen, Van Tinh Le, Thi Tu Kien Than, Khoat Tran, Dang Hung |
author_sort | Nguyen, Van Tinh |
collection | PubMed |
description | Predicting beneficial and valuable miRNA–disease associations (MDAs) by doing biological laboratory experiments is costly and time-consuming. Proposing a forceful and meaningful computational method for predicting MDAs is essential and captivated many computer scientists in recent years. In this paper, we proposed a new computational method to predict miRNA–disease associations using improved random walk with restart and integrating multiple similarities (RWRMMDA). We used a WKNKN algorithm as a pre-processing step to solve the problem of sparsity and incompletion of data to reduce the negative impact of a large number of missing associations. Two heterogeneous networks in disease and miRNA spaces were built by integrating multiple similarity networks, respectively, and different walk probabilities could be designated to each linked neighbor node of the disease or miRNA node in line with its degree in respective networks. Finally, an improve extended random walk with restart algorithm based on miRNA similarity-based and disease similarity-based heterogeneous networks was used to calculate miRNA–disease association prediction probabilities. The experiments showed that our proposed method achieved a momentous performance with Global LOOCV AUC (Area Under Roc Curve) and AUPR (Area Under Precision-Recall Curve) values of 0.9882 and 0.9066, respectively. And the best AUC and AUPR values under fivefold cross-validation of 0.9855 and 0.8642 which are proven by statistical tests, respectively. In comparison with other previous related methods, it outperformed than NTSHMDA, PMFMDA, IMCMDA and MCLPMDA methods in both AUC and AUPR values. In case studies of Breast Neoplasms, Carcinoma Hepatocellular and Stomach Neoplasms diseases, it inferred 1, 12 and 7 new associations out of top 40 predicted associated miRNAs for each disease, respectively. All of these new inferred associations have been confirmed in different databases or literatures. |
format | Online Article Text |
id | pubmed-8548500 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-85485002021-10-28 Predicting miRNA–disease associations using improved random walk with restart and integrating multiple similarities Nguyen, Van Tinh Le, Thi Tu Kien Than, Khoat Tran, Dang Hung Sci Rep Article Predicting beneficial and valuable miRNA–disease associations (MDAs) by doing biological laboratory experiments is costly and time-consuming. Proposing a forceful and meaningful computational method for predicting MDAs is essential and captivated many computer scientists in recent years. In this paper, we proposed a new computational method to predict miRNA–disease associations using improved random walk with restart and integrating multiple similarities (RWRMMDA). We used a WKNKN algorithm as a pre-processing step to solve the problem of sparsity and incompletion of data to reduce the negative impact of a large number of missing associations. Two heterogeneous networks in disease and miRNA spaces were built by integrating multiple similarity networks, respectively, and different walk probabilities could be designated to each linked neighbor node of the disease or miRNA node in line with its degree in respective networks. Finally, an improve extended random walk with restart algorithm based on miRNA similarity-based and disease similarity-based heterogeneous networks was used to calculate miRNA–disease association prediction probabilities. The experiments showed that our proposed method achieved a momentous performance with Global LOOCV AUC (Area Under Roc Curve) and AUPR (Area Under Precision-Recall Curve) values of 0.9882 and 0.9066, respectively. And the best AUC and AUPR values under fivefold cross-validation of 0.9855 and 0.8642 which are proven by statistical tests, respectively. In comparison with other previous related methods, it outperformed than NTSHMDA, PMFMDA, IMCMDA and MCLPMDA methods in both AUC and AUPR values. In case studies of Breast Neoplasms, Carcinoma Hepatocellular and Stomach Neoplasms diseases, it inferred 1, 12 and 7 new associations out of top 40 predicted associated miRNAs for each disease, respectively. All of these new inferred associations have been confirmed in different databases or literatures. Nature Publishing Group UK 2021-10-26 /pmc/articles/PMC8548500/ /pubmed/34702958 http://dx.doi.org/10.1038/s41598-021-00677-w Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Nguyen, Van Tinh Le, Thi Tu Kien Than, Khoat Tran, Dang Hung Predicting miRNA–disease associations using improved random walk with restart and integrating multiple similarities |
title | Predicting miRNA–disease associations using improved random walk with restart and integrating multiple similarities |
title_full | Predicting miRNA–disease associations using improved random walk with restart and integrating multiple similarities |
title_fullStr | Predicting miRNA–disease associations using improved random walk with restart and integrating multiple similarities |
title_full_unstemmed | Predicting miRNA–disease associations using improved random walk with restart and integrating multiple similarities |
title_short | Predicting miRNA–disease associations using improved random walk with restart and integrating multiple similarities |
title_sort | predicting mirna–disease associations using improved random walk with restart and integrating multiple similarities |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8548500/ https://www.ncbi.nlm.nih.gov/pubmed/34702958 http://dx.doi.org/10.1038/s41598-021-00677-w |
work_keys_str_mv | AT nguyenvantinh predictingmirnadiseaseassociationsusingimprovedrandomwalkwithrestartandintegratingmultiplesimilarities AT lethitukien predictingmirnadiseaseassociationsusingimprovedrandomwalkwithrestartandintegratingmultiplesimilarities AT thankhoat predictingmirnadiseaseassociationsusingimprovedrandomwalkwithrestartandintegratingmultiplesimilarities AT trandanghung predictingmirnadiseaseassociationsusingimprovedrandomwalkwithrestartandintegratingmultiplesimilarities |