Cargando…
Novel Human miRNA-Disease Association Inference Based on Random Forest
Since the first microRNA (miRNA) was discovered, a lot of studies have confirmed the associations between miRNAs and human complex diseases. Besides, obtaining and taking advantage of association information between miRNAs and diseases play an increasingly important role in improving the treatment l...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Society of Gene & Cell Therapy
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6234518/ https://www.ncbi.nlm.nih.gov/pubmed/30439645 http://dx.doi.org/10.1016/j.omtn.2018.10.005 |
_version_ | 1783370708028489728 |
---|---|
author | Chen, Xing Wang, Chun-Chun Yin, Jun You, Zhu-Hong |
author_facet | Chen, Xing Wang, Chun-Chun Yin, Jun You, Zhu-Hong |
author_sort | Chen, Xing |
collection | PubMed |
description | Since the first microRNA (miRNA) was discovered, a lot of studies have confirmed the associations between miRNAs and human complex diseases. Besides, obtaining and taking advantage of association information between miRNAs and diseases play an increasingly important role in improving the treatment level for complex diseases. However, due to the high cost of traditional experimental methods, many researchers have proposed different computational methods to predict potential associations between miRNAs and diseases. In this work, we developed a computational model of Random Forest for miRNA-disease association (RFMDA) prediction based on machine learning. The training sample set for RFMDA was constructed according to the human microRNA disease database (HMDD) version (v.)2.0, and the feature vectors to represent miRNA-disease samples were defined by integrating miRNA functional similarity, disease semantic similarity, and Gaussian interaction profile kernel similarity. The Random Forest algorithm was first employed to infer miRNA-disease associations. In addition, a filter-based method was implemented to select robust features from the miRNA-disease feature set, which could efficiently distinguish related miRNA-disease pairs from unrelated miRNA-disease pairs. RFMDA achieved areas under the curve (AUCs) of 0.8891, 0.8323, and 0.8818 ± 0.0014 under global leave-one-out cross-validation, local leave-one-out cross-validation, and 5-fold cross-validation, respectively, which were higher than many previous computational models. To further evaluate the accuracy of RFMDA, we carried out three types of case studies for four human complex diseases. As a result, 43 (esophageal neoplasms), 46 (lymphoma), 47 (lung neoplasms), and 48 (breast neoplasms) of the top 50 predicted disease-related miRNAs were verified by experiments in different kinds of case studies. The results of cross-validation and case studies indicated that RFMDA is a reliable model for predicting miRNA-disease associations. |
format | Online Article Text |
id | pubmed-6234518 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | American Society of Gene & Cell Therapy |
record_format | MEDLINE/PubMed |
spelling | pubmed-62345182018-11-19 Novel Human miRNA-Disease Association Inference Based on Random Forest Chen, Xing Wang, Chun-Chun Yin, Jun You, Zhu-Hong Mol Ther Nucleic Acids Article Since the first microRNA (miRNA) was discovered, a lot of studies have confirmed the associations between miRNAs and human complex diseases. Besides, obtaining and taking advantage of association information between miRNAs and diseases play an increasingly important role in improving the treatment level for complex diseases. However, due to the high cost of traditional experimental methods, many researchers have proposed different computational methods to predict potential associations between miRNAs and diseases. In this work, we developed a computational model of Random Forest for miRNA-disease association (RFMDA) prediction based on machine learning. The training sample set for RFMDA was constructed according to the human microRNA disease database (HMDD) version (v.)2.0, and the feature vectors to represent miRNA-disease samples were defined by integrating miRNA functional similarity, disease semantic similarity, and Gaussian interaction profile kernel similarity. The Random Forest algorithm was first employed to infer miRNA-disease associations. In addition, a filter-based method was implemented to select robust features from the miRNA-disease feature set, which could efficiently distinguish related miRNA-disease pairs from unrelated miRNA-disease pairs. RFMDA achieved areas under the curve (AUCs) of 0.8891, 0.8323, and 0.8818 ± 0.0014 under global leave-one-out cross-validation, local leave-one-out cross-validation, and 5-fold cross-validation, respectively, which were higher than many previous computational models. To further evaluate the accuracy of RFMDA, we carried out three types of case studies for four human complex diseases. As a result, 43 (esophageal neoplasms), 46 (lymphoma), 47 (lung neoplasms), and 48 (breast neoplasms) of the top 50 predicted disease-related miRNAs were verified by experiments in different kinds of case studies. The results of cross-validation and case studies indicated that RFMDA is a reliable model for predicting miRNA-disease associations. American Society of Gene & Cell Therapy 2018-10-11 /pmc/articles/PMC6234518/ /pubmed/30439645 http://dx.doi.org/10.1016/j.omtn.2018.10.005 Text en © 2018 The Author(s) http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Article Chen, Xing Wang, Chun-Chun Yin, Jun You, Zhu-Hong Novel Human miRNA-Disease Association Inference Based on Random Forest |
title | Novel Human miRNA-Disease Association Inference Based on Random Forest |
title_full | Novel Human miRNA-Disease Association Inference Based on Random Forest |
title_fullStr | Novel Human miRNA-Disease Association Inference Based on Random Forest |
title_full_unstemmed | Novel Human miRNA-Disease Association Inference Based on Random Forest |
title_short | Novel Human miRNA-Disease Association Inference Based on Random Forest |
title_sort | novel human mirna-disease association inference based on random forest |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6234518/ https://www.ncbi.nlm.nih.gov/pubmed/30439645 http://dx.doi.org/10.1016/j.omtn.2018.10.005 |
work_keys_str_mv | AT chenxing novelhumanmirnadiseaseassociationinferencebasedonrandomforest AT wangchunchun novelhumanmirnadiseaseassociationinferencebasedonrandomforest AT yinjun novelhumanmirnadiseaseassociationinferencebasedonrandomforest AT youzhuhong novelhumanmirnadiseaseassociationinferencebasedonrandomforest |