Cargando…

Novel Human miRNA-Disease Association Inference Based on Random Forest

Since the first microRNA (miRNA) was discovered, a lot of studies have confirmed the associations between miRNAs and human complex diseases. Besides, obtaining and taking advantage of association information between miRNAs and diseases play an increasingly important role in improving the treatment l...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Xing, Wang, Chun-Chun, Yin, Jun, You, Zhu-Hong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society of Gene & Cell Therapy 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6234518/
https://www.ncbi.nlm.nih.gov/pubmed/30439645
http://dx.doi.org/10.1016/j.omtn.2018.10.005
_version_ 1783370708028489728
author Chen, Xing
Wang, Chun-Chun
Yin, Jun
You, Zhu-Hong
author_facet Chen, Xing
Wang, Chun-Chun
Yin, Jun
You, Zhu-Hong
author_sort Chen, Xing
collection PubMed
description Since the first microRNA (miRNA) was discovered, a lot of studies have confirmed the associations between miRNAs and human complex diseases. Besides, obtaining and taking advantage of association information between miRNAs and diseases play an increasingly important role in improving the treatment level for complex diseases. However, due to the high cost of traditional experimental methods, many researchers have proposed different computational methods to predict potential associations between miRNAs and diseases. In this work, we developed a computational model of Random Forest for miRNA-disease association (RFMDA) prediction based on machine learning. The training sample set for RFMDA was constructed according to the human microRNA disease database (HMDD) version (v.)2.0, and the feature vectors to represent miRNA-disease samples were defined by integrating miRNA functional similarity, disease semantic similarity, and Gaussian interaction profile kernel similarity. The Random Forest algorithm was first employed to infer miRNA-disease associations. In addition, a filter-based method was implemented to select robust features from the miRNA-disease feature set, which could efficiently distinguish related miRNA-disease pairs from unrelated miRNA-disease pairs. RFMDA achieved areas under the curve (AUCs) of 0.8891, 0.8323, and 0.8818 ± 0.0014 under global leave-one-out cross-validation, local leave-one-out cross-validation, and 5-fold cross-validation, respectively, which were higher than many previous computational models. To further evaluate the accuracy of RFMDA, we carried out three types of case studies for four human complex diseases. As a result, 43 (esophageal neoplasms), 46 (lymphoma), 47 (lung neoplasms), and 48 (breast neoplasms) of the top 50 predicted disease-related miRNAs were verified by experiments in different kinds of case studies. The results of cross-validation and case studies indicated that RFMDA is a reliable model for predicting miRNA-disease associations.
format Online
Article
Text
id pubmed-6234518
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher American Society of Gene & Cell Therapy
record_format MEDLINE/PubMed
spelling pubmed-62345182018-11-19 Novel Human miRNA-Disease Association Inference Based on Random Forest Chen, Xing Wang, Chun-Chun Yin, Jun You, Zhu-Hong Mol Ther Nucleic Acids Article Since the first microRNA (miRNA) was discovered, a lot of studies have confirmed the associations between miRNAs and human complex diseases. Besides, obtaining and taking advantage of association information between miRNAs and diseases play an increasingly important role in improving the treatment level for complex diseases. However, due to the high cost of traditional experimental methods, many researchers have proposed different computational methods to predict potential associations between miRNAs and diseases. In this work, we developed a computational model of Random Forest for miRNA-disease association (RFMDA) prediction based on machine learning. The training sample set for RFMDA was constructed according to the human microRNA disease database (HMDD) version (v.)2.0, and the feature vectors to represent miRNA-disease samples were defined by integrating miRNA functional similarity, disease semantic similarity, and Gaussian interaction profile kernel similarity. The Random Forest algorithm was first employed to infer miRNA-disease associations. In addition, a filter-based method was implemented to select robust features from the miRNA-disease feature set, which could efficiently distinguish related miRNA-disease pairs from unrelated miRNA-disease pairs. RFMDA achieved areas under the curve (AUCs) of 0.8891, 0.8323, and 0.8818 ± 0.0014 under global leave-one-out cross-validation, local leave-one-out cross-validation, and 5-fold cross-validation, respectively, which were higher than many previous computational models. To further evaluate the accuracy of RFMDA, we carried out three types of case studies for four human complex diseases. As a result, 43 (esophageal neoplasms), 46 (lymphoma), 47 (lung neoplasms), and 48 (breast neoplasms) of the top 50 predicted disease-related miRNAs were verified by experiments in different kinds of case studies. The results of cross-validation and case studies indicated that RFMDA is a reliable model for predicting miRNA-disease associations. American Society of Gene & Cell Therapy 2018-10-11 /pmc/articles/PMC6234518/ /pubmed/30439645 http://dx.doi.org/10.1016/j.omtn.2018.10.005 Text en © 2018 The Author(s) http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Chen, Xing
Wang, Chun-Chun
Yin, Jun
You, Zhu-Hong
Novel Human miRNA-Disease Association Inference Based on Random Forest
title Novel Human miRNA-Disease Association Inference Based on Random Forest
title_full Novel Human miRNA-Disease Association Inference Based on Random Forest
title_fullStr Novel Human miRNA-Disease Association Inference Based on Random Forest
title_full_unstemmed Novel Human miRNA-Disease Association Inference Based on Random Forest
title_short Novel Human miRNA-Disease Association Inference Based on Random Forest
title_sort novel human mirna-disease association inference based on random forest
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6234518/
https://www.ncbi.nlm.nih.gov/pubmed/30439645
http://dx.doi.org/10.1016/j.omtn.2018.10.005
work_keys_str_mv AT chenxing novelhumanmirnadiseaseassociationinferencebasedonrandomforest
AT wangchunchun novelhumanmirnadiseaseassociationinferencebasedonrandomforest
AT yinjun novelhumanmirnadiseaseassociationinferencebasedonrandomforest
AT youzhuhong novelhumanmirnadiseaseassociationinferencebasedonrandomforest