Cargando…
A Computational Study of Potential miRNA-Disease Association Inference Based on Ensemble Learning and Kernel Ridge Regression
As increasing experimental studies have shown that microRNAs (miRNAs) are closely related to multiple biological processes and the prevention, diagnosis and treatment of human diseases, a growing number of researchers are focusing on the identification of associations between miRNAs and diseases. Id...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7015868/ https://www.ncbi.nlm.nih.gov/pubmed/32117922 http://dx.doi.org/10.3389/fbioe.2020.00040 |
_version_ | 1783496868704026624 |
---|---|
author | Peng, Li-Hong Zhou, Li-Qian Chen, Xing Piao, Xue |
author_facet | Peng, Li-Hong Zhou, Li-Qian Chen, Xing Piao, Xue |
author_sort | Peng, Li-Hong |
collection | PubMed |
description | As increasing experimental studies have shown that microRNAs (miRNAs) are closely related to multiple biological processes and the prevention, diagnosis and treatment of human diseases, a growing number of researchers are focusing on the identification of associations between miRNAs and diseases. Identifying such associations purely via experiments is costly and demanding, which prompts researchers to develop computational methods to complement the experiments. In this paper, a novel prediction model named Ensemble of Kernel Ridge Regression based MiRNA-Disease Association prediction (EKRRMDA) was developed. EKRRMDA obtained features of miRNAs and diseases by integrating the disease semantic similarity, the miRNA functional similarity and the Gaussian interaction profile kernel similarity for diseases and miRNAs. Under the computational framework that utilized ensemble learning and feature dimensionality reduction, multiple base classifiers that combined two Kernel Ridge Regression classifiers from the miRNA side and disease side, respectively, were obtained based on random selection of features. Then average strategy for these base classifiers was adopted to obtain final association scores of miRNA-disease pairs. In the global and local leave-one-out cross validation, EKRRMDA attained the AUCs of 0.9314 and 0.8618, respectively. Moreover, the model’s average AUC with standard deviation in 5-fold cross validation was 0.9275 ± 0.0008. In addition, we implemented three different types of case studies on predicting miRNAs associated with five important diseases. As a result, there were 90% (Esophageal Neoplasms), 86% (Kidney Neoplasms), 86% (Lymphoma), 98% (Lung Neoplasms), and 96% (Breast Neoplasms) of the top 50 predicted miRNAs verified to have associations with these diseases. |
format | Online Article Text |
id | pubmed-7015868 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-70158682020-02-28 A Computational Study of Potential miRNA-Disease Association Inference Based on Ensemble Learning and Kernel Ridge Regression Peng, Li-Hong Zhou, Li-Qian Chen, Xing Piao, Xue Front Bioeng Biotechnol Bioengineering and Biotechnology As increasing experimental studies have shown that microRNAs (miRNAs) are closely related to multiple biological processes and the prevention, diagnosis and treatment of human diseases, a growing number of researchers are focusing on the identification of associations between miRNAs and diseases. Identifying such associations purely via experiments is costly and demanding, which prompts researchers to develop computational methods to complement the experiments. In this paper, a novel prediction model named Ensemble of Kernel Ridge Regression based MiRNA-Disease Association prediction (EKRRMDA) was developed. EKRRMDA obtained features of miRNAs and diseases by integrating the disease semantic similarity, the miRNA functional similarity and the Gaussian interaction profile kernel similarity for diseases and miRNAs. Under the computational framework that utilized ensemble learning and feature dimensionality reduction, multiple base classifiers that combined two Kernel Ridge Regression classifiers from the miRNA side and disease side, respectively, were obtained based on random selection of features. Then average strategy for these base classifiers was adopted to obtain final association scores of miRNA-disease pairs. In the global and local leave-one-out cross validation, EKRRMDA attained the AUCs of 0.9314 and 0.8618, respectively. Moreover, the model’s average AUC with standard deviation in 5-fold cross validation was 0.9275 ± 0.0008. In addition, we implemented three different types of case studies on predicting miRNAs associated with five important diseases. As a result, there were 90% (Esophageal Neoplasms), 86% (Kidney Neoplasms), 86% (Lymphoma), 98% (Lung Neoplasms), and 96% (Breast Neoplasms) of the top 50 predicted miRNAs verified to have associations with these diseases. Frontiers Media S.A. 2020-02-06 /pmc/articles/PMC7015868/ /pubmed/32117922 http://dx.doi.org/10.3389/fbioe.2020.00040 Text en Copyright © 2020 Peng, Zhou, Chen and Piao. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Bioengineering and Biotechnology Peng, Li-Hong Zhou, Li-Qian Chen, Xing Piao, Xue A Computational Study of Potential miRNA-Disease Association Inference Based on Ensemble Learning and Kernel Ridge Regression |
title | A Computational Study of Potential miRNA-Disease Association Inference Based on Ensemble Learning and Kernel Ridge Regression |
title_full | A Computational Study of Potential miRNA-Disease Association Inference Based on Ensemble Learning and Kernel Ridge Regression |
title_fullStr | A Computational Study of Potential miRNA-Disease Association Inference Based on Ensemble Learning and Kernel Ridge Regression |
title_full_unstemmed | A Computational Study of Potential miRNA-Disease Association Inference Based on Ensemble Learning and Kernel Ridge Regression |
title_short | A Computational Study of Potential miRNA-Disease Association Inference Based on Ensemble Learning and Kernel Ridge Regression |
title_sort | computational study of potential mirna-disease association inference based on ensemble learning and kernel ridge regression |
topic | Bioengineering and Biotechnology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7015868/ https://www.ncbi.nlm.nih.gov/pubmed/32117922 http://dx.doi.org/10.3389/fbioe.2020.00040 |
work_keys_str_mv | AT penglihong acomputationalstudyofpotentialmirnadiseaseassociationinferencebasedonensemblelearningandkernelridgeregression AT zhouliqian acomputationalstudyofpotentialmirnadiseaseassociationinferencebasedonensemblelearningandkernelridgeregression AT chenxing acomputationalstudyofpotentialmirnadiseaseassociationinferencebasedonensemblelearningandkernelridgeregression AT piaoxue acomputationalstudyofpotentialmirnadiseaseassociationinferencebasedonensemblelearningandkernelridgeregression AT penglihong computationalstudyofpotentialmirnadiseaseassociationinferencebasedonensemblelearningandkernelridgeregression AT zhouliqian computationalstudyofpotentialmirnadiseaseassociationinferencebasedonensemblelearningandkernelridgeregression AT chenxing computationalstudyofpotentialmirnadiseaseassociationinferencebasedonensemblelearningandkernelridgeregression AT piaoxue computationalstudyofpotentialmirnadiseaseassociationinferencebasedonensemblelearningandkernelridgeregression |