Cargando…

A Computational Study of Potential miRNA-Disease Association Inference Based on Ensemble Learning and Kernel Ridge Regression

As increasing experimental studies have shown that microRNAs (miRNAs) are closely related to multiple biological processes and the prevention, diagnosis and treatment of human diseases, a growing number of researchers are focusing on the identification of associations between miRNAs and diseases. Id...

Descripción completa

Detalles Bibliográficos
Autores principales: Peng, Li-Hong, Zhou, Li-Qian, Chen, Xing, Piao, Xue
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7015868/
https://www.ncbi.nlm.nih.gov/pubmed/32117922
http://dx.doi.org/10.3389/fbioe.2020.00040
_version_ 1783496868704026624
author Peng, Li-Hong
Zhou, Li-Qian
Chen, Xing
Piao, Xue
author_facet Peng, Li-Hong
Zhou, Li-Qian
Chen, Xing
Piao, Xue
author_sort Peng, Li-Hong
collection PubMed
description As increasing experimental studies have shown that microRNAs (miRNAs) are closely related to multiple biological processes and the prevention, diagnosis and treatment of human diseases, a growing number of researchers are focusing on the identification of associations between miRNAs and diseases. Identifying such associations purely via experiments is costly and demanding, which prompts researchers to develop computational methods to complement the experiments. In this paper, a novel prediction model named Ensemble of Kernel Ridge Regression based MiRNA-Disease Association prediction (EKRRMDA) was developed. EKRRMDA obtained features of miRNAs and diseases by integrating the disease semantic similarity, the miRNA functional similarity and the Gaussian interaction profile kernel similarity for diseases and miRNAs. Under the computational framework that utilized ensemble learning and feature dimensionality reduction, multiple base classifiers that combined two Kernel Ridge Regression classifiers from the miRNA side and disease side, respectively, were obtained based on random selection of features. Then average strategy for these base classifiers was adopted to obtain final association scores of miRNA-disease pairs. In the global and local leave-one-out cross validation, EKRRMDA attained the AUCs of 0.9314 and 0.8618, respectively. Moreover, the model’s average AUC with standard deviation in 5-fold cross validation was 0.9275 ± 0.0008. In addition, we implemented three different types of case studies on predicting miRNAs associated with five important diseases. As a result, there were 90% (Esophageal Neoplasms), 86% (Kidney Neoplasms), 86% (Lymphoma), 98% (Lung Neoplasms), and 96% (Breast Neoplasms) of the top 50 predicted miRNAs verified to have associations with these diseases.
format Online
Article
Text
id pubmed-7015868
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-70158682020-02-28 A Computational Study of Potential miRNA-Disease Association Inference Based on Ensemble Learning and Kernel Ridge Regression Peng, Li-Hong Zhou, Li-Qian Chen, Xing Piao, Xue Front Bioeng Biotechnol Bioengineering and Biotechnology As increasing experimental studies have shown that microRNAs (miRNAs) are closely related to multiple biological processes and the prevention, diagnosis and treatment of human diseases, a growing number of researchers are focusing on the identification of associations between miRNAs and diseases. Identifying such associations purely via experiments is costly and demanding, which prompts researchers to develop computational methods to complement the experiments. In this paper, a novel prediction model named Ensemble of Kernel Ridge Regression based MiRNA-Disease Association prediction (EKRRMDA) was developed. EKRRMDA obtained features of miRNAs and diseases by integrating the disease semantic similarity, the miRNA functional similarity and the Gaussian interaction profile kernel similarity for diseases and miRNAs. Under the computational framework that utilized ensemble learning and feature dimensionality reduction, multiple base classifiers that combined two Kernel Ridge Regression classifiers from the miRNA side and disease side, respectively, were obtained based on random selection of features. Then average strategy for these base classifiers was adopted to obtain final association scores of miRNA-disease pairs. In the global and local leave-one-out cross validation, EKRRMDA attained the AUCs of 0.9314 and 0.8618, respectively. Moreover, the model’s average AUC with standard deviation in 5-fold cross validation was 0.9275 ± 0.0008. In addition, we implemented three different types of case studies on predicting miRNAs associated with five important diseases. As a result, there were 90% (Esophageal Neoplasms), 86% (Kidney Neoplasms), 86% (Lymphoma), 98% (Lung Neoplasms), and 96% (Breast Neoplasms) of the top 50 predicted miRNAs verified to have associations with these diseases. Frontiers Media S.A. 2020-02-06 /pmc/articles/PMC7015868/ /pubmed/32117922 http://dx.doi.org/10.3389/fbioe.2020.00040 Text en Copyright © 2020 Peng, Zhou, Chen and Piao. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioengineering and Biotechnology
Peng, Li-Hong
Zhou, Li-Qian
Chen, Xing
Piao, Xue
A Computational Study of Potential miRNA-Disease Association Inference Based on Ensemble Learning and Kernel Ridge Regression
title A Computational Study of Potential miRNA-Disease Association Inference Based on Ensemble Learning and Kernel Ridge Regression
title_full A Computational Study of Potential miRNA-Disease Association Inference Based on Ensemble Learning and Kernel Ridge Regression
title_fullStr A Computational Study of Potential miRNA-Disease Association Inference Based on Ensemble Learning and Kernel Ridge Regression
title_full_unstemmed A Computational Study of Potential miRNA-Disease Association Inference Based on Ensemble Learning and Kernel Ridge Regression
title_short A Computational Study of Potential miRNA-Disease Association Inference Based on Ensemble Learning and Kernel Ridge Regression
title_sort computational study of potential mirna-disease association inference based on ensemble learning and kernel ridge regression
topic Bioengineering and Biotechnology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7015868/
https://www.ncbi.nlm.nih.gov/pubmed/32117922
http://dx.doi.org/10.3389/fbioe.2020.00040
work_keys_str_mv AT penglihong acomputationalstudyofpotentialmirnadiseaseassociationinferencebasedonensemblelearningandkernelridgeregression
AT zhouliqian acomputationalstudyofpotentialmirnadiseaseassociationinferencebasedonensemblelearningandkernelridgeregression
AT chenxing acomputationalstudyofpotentialmirnadiseaseassociationinferencebasedonensemblelearningandkernelridgeregression
AT piaoxue acomputationalstudyofpotentialmirnadiseaseassociationinferencebasedonensemblelearningandkernelridgeregression
AT penglihong computationalstudyofpotentialmirnadiseaseassociationinferencebasedonensemblelearningandkernelridgeregression
AT zhouliqian computationalstudyofpotentialmirnadiseaseassociationinferencebasedonensemblelearningandkernelridgeregression
AT chenxing computationalstudyofpotentialmirnadiseaseassociationinferencebasedonensemblelearningandkernelridgeregression
AT piaoxue computationalstudyofpotentialmirnadiseaseassociationinferencebasedonensemblelearningandkernelridgeregression