Cargando…

Ensemble Methods for MiRNA Target Prediction from Expression Data

BACKGROUND: microRNAs (miRNAs) are short regulatory RNAs that are involved in several diseases, including cancers. Identifying miRNA functions is very important in understanding disease mechanisms and determining the efficacy of drugs. An increasing number of computational methods have been develope...

Descripción completa

Detalles Bibliográficos
Autores principales: Le, Thuc Duy, Zhang, Junpeng, Liu, Lin, Li, Jiuyong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4482624/
https://www.ncbi.nlm.nih.gov/pubmed/26114448
http://dx.doi.org/10.1371/journal.pone.0131627
_version_ 1782378473443033088
author Le, Thuc Duy
Zhang, Junpeng
Liu, Lin
Li, Jiuyong
author_facet Le, Thuc Duy
Zhang, Junpeng
Liu, Lin
Li, Jiuyong
author_sort Le, Thuc Duy
collection PubMed
description BACKGROUND: microRNAs (miRNAs) are short regulatory RNAs that are involved in several diseases, including cancers. Identifying miRNA functions is very important in understanding disease mechanisms and determining the efficacy of drugs. An increasing number of computational methods have been developed to explore miRNA functions by inferring the miRNA-mRNA regulatory relationships from data. Each of the methods is developed based on some assumptions and constraints, for instance, assuming linear relationships between variables. For such reasons, computational methods are often subject to the problem of inconsistent performance across different datasets. On the other hand, ensemble methods integrate the results from individual methods and have been proved to outperform each of their individual component methods in theory. RESULTS: In this paper, we investigate the performance of some ensemble methods over the commonly used miRNA target prediction methods. We apply eight different popular miRNA target prediction methods to three cancer datasets, and compare their performance with the ensemble methods which integrate the results from each combination of the individual methods. The validation results using experimentally confirmed databases show that the results of the ensemble methods complement those obtained by the individual methods and the ensemble methods perform better than the individual methods across different datasets. The ensemble method, Pearson+IDA+Lasso, which combines methods in different approaches, including a correlation method, a causal inference method, and a regression method, is the best performed ensemble method in this study. Further analysis of the results of this ensemble method shows that the ensemble method can obtain more targets which could not be found by any of the single methods, and the discovered targets are more statistically significant and functionally enriched. The source codes, datasets, miRNA target predictions by all methods, and the ground truth for validation are available in the Supplementary materials.
format Online
Article
Text
id pubmed-4482624
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-44826242015-06-29 Ensemble Methods for MiRNA Target Prediction from Expression Data Le, Thuc Duy Zhang, Junpeng Liu, Lin Li, Jiuyong PLoS One Research Article BACKGROUND: microRNAs (miRNAs) are short regulatory RNAs that are involved in several diseases, including cancers. Identifying miRNA functions is very important in understanding disease mechanisms and determining the efficacy of drugs. An increasing number of computational methods have been developed to explore miRNA functions by inferring the miRNA-mRNA regulatory relationships from data. Each of the methods is developed based on some assumptions and constraints, for instance, assuming linear relationships between variables. For such reasons, computational methods are often subject to the problem of inconsistent performance across different datasets. On the other hand, ensemble methods integrate the results from individual methods and have been proved to outperform each of their individual component methods in theory. RESULTS: In this paper, we investigate the performance of some ensemble methods over the commonly used miRNA target prediction methods. We apply eight different popular miRNA target prediction methods to three cancer datasets, and compare their performance with the ensemble methods which integrate the results from each combination of the individual methods. The validation results using experimentally confirmed databases show that the results of the ensemble methods complement those obtained by the individual methods and the ensemble methods perform better than the individual methods across different datasets. The ensemble method, Pearson+IDA+Lasso, which combines methods in different approaches, including a correlation method, a causal inference method, and a regression method, is the best performed ensemble method in this study. Further analysis of the results of this ensemble method shows that the ensemble method can obtain more targets which could not be found by any of the single methods, and the discovered targets are more statistically significant and functionally enriched. The source codes, datasets, miRNA target predictions by all methods, and the ground truth for validation are available in the Supplementary materials. Public Library of Science 2015-06-26 /pmc/articles/PMC4482624/ /pubmed/26114448 http://dx.doi.org/10.1371/journal.pone.0131627 Text en © 2015 Le et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Le, Thuc Duy
Zhang, Junpeng
Liu, Lin
Li, Jiuyong
Ensemble Methods for MiRNA Target Prediction from Expression Data
title Ensemble Methods for MiRNA Target Prediction from Expression Data
title_full Ensemble Methods for MiRNA Target Prediction from Expression Data
title_fullStr Ensemble Methods for MiRNA Target Prediction from Expression Data
title_full_unstemmed Ensemble Methods for MiRNA Target Prediction from Expression Data
title_short Ensemble Methods for MiRNA Target Prediction from Expression Data
title_sort ensemble methods for mirna target prediction from expression data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4482624/
https://www.ncbi.nlm.nih.gov/pubmed/26114448
http://dx.doi.org/10.1371/journal.pone.0131627
work_keys_str_mv AT lethucduy ensemblemethodsformirnatargetpredictionfromexpressiondata
AT zhangjunpeng ensemblemethodsformirnatargetpredictionfromexpressiondata
AT liulin ensemblemethodsformirnatargetpredictionfromexpressiondata
AT lijiuyong ensemblemethodsformirnatargetpredictionfromexpressiondata