Cargando…

Ensemble of decision tree reveals potential miRNA-disease associations

In recent years, increasing associations between microRNAs (miRNAs) and human diseases have been identified. Based on accumulating biological data, many computational models for potential miRNA-disease associations inference have been developed, which saves time and expenditure on experimental studi...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Xing, Zhu, Chi-Chi, Yin, Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6675125/
https://www.ncbi.nlm.nih.gov/pubmed/31329575
http://dx.doi.org/10.1371/journal.pcbi.1007209
_version_ 1783440622847262720
author Chen, Xing
Zhu, Chi-Chi
Yin, Jun
author_facet Chen, Xing
Zhu, Chi-Chi
Yin, Jun
author_sort Chen, Xing
collection PubMed
description In recent years, increasing associations between microRNAs (miRNAs) and human diseases have been identified. Based on accumulating biological data, many computational models for potential miRNA-disease associations inference have been developed, which saves time and expenditure on experimental studies, making great contributions to researching molecular mechanism of human diseases and developing new drugs for disease treatment. In this paper, we proposed a novel computational method named Ensemble of Decision Tree based MiRNA-Disease Association prediction (EDTMDA), which innovatively built a computational framework integrating ensemble learning and dimensionality reduction. For each miRNA-disease pair, the feature vector was extracted by calculating the statistical measures, graph theoretical measures, and matrix factorization results for the miRNA and disease, respectively. Then multiple base learnings were built to yield many decision trees (DTs) based on random selection of negative samples and miRNA/disease features. Particularly, Principal Components Analysis was applied to each base learning to reduce feature dimensionality and hence remove the noise or redundancy. Average strategy was adopted for these DTs to get final association scores between miRNAs and diseases. In model performance evaluation, EDTMDA showed AUC of 0.9309 in global leave-one-out cross validation (LOOCV) and AUC of 0.8524 in local LOOCV. Additionally, AUC of 0.9192+/-0.0009 in 5-fold cross validation proved the model’s reliability and stability. Furthermore, three types of case studies for four human diseases were implemented. As a result, 94% (Esophageal Neoplasms), 86% (Kidney Neoplasms), 96% (Breast Neoplasms) and 88% (Carcinoma Hepatocellular) of top 50 predicted miRNAs were confirmed by experimental evidences in literature.
format Online
Article
Text
id pubmed-6675125
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-66751252019-08-06 Ensemble of decision tree reveals potential miRNA-disease associations Chen, Xing Zhu, Chi-Chi Yin, Jun PLoS Comput Biol Research Article In recent years, increasing associations between microRNAs (miRNAs) and human diseases have been identified. Based on accumulating biological data, many computational models for potential miRNA-disease associations inference have been developed, which saves time and expenditure on experimental studies, making great contributions to researching molecular mechanism of human diseases and developing new drugs for disease treatment. In this paper, we proposed a novel computational method named Ensemble of Decision Tree based MiRNA-Disease Association prediction (EDTMDA), which innovatively built a computational framework integrating ensemble learning and dimensionality reduction. For each miRNA-disease pair, the feature vector was extracted by calculating the statistical measures, graph theoretical measures, and matrix factorization results for the miRNA and disease, respectively. Then multiple base learnings were built to yield many decision trees (DTs) based on random selection of negative samples and miRNA/disease features. Particularly, Principal Components Analysis was applied to each base learning to reduce feature dimensionality and hence remove the noise or redundancy. Average strategy was adopted for these DTs to get final association scores between miRNAs and diseases. In model performance evaluation, EDTMDA showed AUC of 0.9309 in global leave-one-out cross validation (LOOCV) and AUC of 0.8524 in local LOOCV. Additionally, AUC of 0.9192+/-0.0009 in 5-fold cross validation proved the model’s reliability and stability. Furthermore, three types of case studies for four human diseases were implemented. As a result, 94% (Esophageal Neoplasms), 86% (Kidney Neoplasms), 96% (Breast Neoplasms) and 88% (Carcinoma Hepatocellular) of top 50 predicted miRNAs were confirmed by experimental evidences in literature. Public Library of Science 2019-07-22 /pmc/articles/PMC6675125/ /pubmed/31329575 http://dx.doi.org/10.1371/journal.pcbi.1007209 Text en © 2019 Chen et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Chen, Xing
Zhu, Chi-Chi
Yin, Jun
Ensemble of decision tree reveals potential miRNA-disease associations
title Ensemble of decision tree reveals potential miRNA-disease associations
title_full Ensemble of decision tree reveals potential miRNA-disease associations
title_fullStr Ensemble of decision tree reveals potential miRNA-disease associations
title_full_unstemmed Ensemble of decision tree reveals potential miRNA-disease associations
title_short Ensemble of decision tree reveals potential miRNA-disease associations
title_sort ensemble of decision tree reveals potential mirna-disease associations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6675125/
https://www.ncbi.nlm.nih.gov/pubmed/31329575
http://dx.doi.org/10.1371/journal.pcbi.1007209
work_keys_str_mv AT chenxing ensembleofdecisiontreerevealspotentialmirnadiseaseassociations
AT zhuchichi ensembleofdecisiontreerevealspotentialmirnadiseaseassociations
AT yinjun ensembleofdecisiontreerevealspotentialmirnadiseaseassociations