Cargando…
RFMirTarget: Predicting Human MicroRNA Target Genes with a Random Forest Classifier
MicroRNAs are key regulators of eukaryotic gene expression whose fundamental role has already been identified in many cell pathways. The correct identification of miRNAs targets is still a major challenge in bioinformatics and has motivated the development of several computational methods to overcom...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3724815/ https://www.ncbi.nlm.nih.gov/pubmed/23922946 http://dx.doi.org/10.1371/journal.pone.0070153 |
_version_ | 1782476727619944448 |
---|---|
author | Mendoza, Mariana R. da Fonseca, Guilherme C. Loss-Morais, Guilherme Alves, Ronnie Margis, Rogerio Bazzan, Ana L. C. |
author_facet | Mendoza, Mariana R. da Fonseca, Guilherme C. Loss-Morais, Guilherme Alves, Ronnie Margis, Rogerio Bazzan, Ana L. C. |
author_sort | Mendoza, Mariana R. |
collection | PubMed |
description | MicroRNAs are key regulators of eukaryotic gene expression whose fundamental role has already been identified in many cell pathways. The correct identification of miRNAs targets is still a major challenge in bioinformatics and has motivated the development of several computational methods to overcome inherent limitations of experimental analysis. Indeed, the best results reported so far in terms of specificity and sensitivity are associated to machine learning-based methods for microRNA-target prediction. Following this trend, in the current paper we discuss and explore a microRNA-target prediction method based on a random forest classifier, namely RFMirTarget. Despite its well-known robustness regarding general classifying tasks, to the best of our knowledge, random forest have not been deeply explored for the specific context of predicting microRNAs targets. Our framework first analyzes alignments between candidate microRNA-target pairs and extracts a set of structural, thermodynamics, alignment, seed and position-based features, upon which classification is performed. Experiments have shown that RFMirTarget outperforms several well-known classifiers with statistical significance, and that its performance is not impaired by the class imbalance problem or features correlation. Moreover, comparing it against other algorithms for microRNA target prediction using independent test data sets from TarBase and starBase, we observe a very promising performance, with higher sensitivity in relation to other methods. Finally, tests performed with RFMirTarget show the benefits of feature selection even for a classifier with embedded feature importance analysis, and the consistency between relevant features identified and important biological properties for effective microRNA-target gene alignment. |
format | Online Article Text |
id | pubmed-3724815 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-37248152013-08-06 RFMirTarget: Predicting Human MicroRNA Target Genes with a Random Forest Classifier Mendoza, Mariana R. da Fonseca, Guilherme C. Loss-Morais, Guilherme Alves, Ronnie Margis, Rogerio Bazzan, Ana L. C. PLoS One Research Article MicroRNAs are key regulators of eukaryotic gene expression whose fundamental role has already been identified in many cell pathways. The correct identification of miRNAs targets is still a major challenge in bioinformatics and has motivated the development of several computational methods to overcome inherent limitations of experimental analysis. Indeed, the best results reported so far in terms of specificity and sensitivity are associated to machine learning-based methods for microRNA-target prediction. Following this trend, in the current paper we discuss and explore a microRNA-target prediction method based on a random forest classifier, namely RFMirTarget. Despite its well-known robustness regarding general classifying tasks, to the best of our knowledge, random forest have not been deeply explored for the specific context of predicting microRNAs targets. Our framework first analyzes alignments between candidate microRNA-target pairs and extracts a set of structural, thermodynamics, alignment, seed and position-based features, upon which classification is performed. Experiments have shown that RFMirTarget outperforms several well-known classifiers with statistical significance, and that its performance is not impaired by the class imbalance problem or features correlation. Moreover, comparing it against other algorithms for microRNA target prediction using independent test data sets from TarBase and starBase, we observe a very promising performance, with higher sensitivity in relation to other methods. Finally, tests performed with RFMirTarget show the benefits of feature selection even for a classifier with embedded feature importance analysis, and the consistency between relevant features identified and important biological properties for effective microRNA-target gene alignment. Public Library of Science 2013-07-26 /pmc/articles/PMC3724815/ /pubmed/23922946 http://dx.doi.org/10.1371/journal.pone.0070153 Text en © 2013 Mendoza et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Mendoza, Mariana R. da Fonseca, Guilherme C. Loss-Morais, Guilherme Alves, Ronnie Margis, Rogerio Bazzan, Ana L. C. RFMirTarget: Predicting Human MicroRNA Target Genes with a Random Forest Classifier |
title | RFMirTarget: Predicting Human MicroRNA Target Genes with a Random Forest Classifier |
title_full | RFMirTarget: Predicting Human MicroRNA Target Genes with a Random Forest Classifier |
title_fullStr | RFMirTarget: Predicting Human MicroRNA Target Genes with a Random Forest Classifier |
title_full_unstemmed | RFMirTarget: Predicting Human MicroRNA Target Genes with a Random Forest Classifier |
title_short | RFMirTarget: Predicting Human MicroRNA Target Genes with a Random Forest Classifier |
title_sort | rfmirtarget: predicting human microrna target genes with a random forest classifier |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3724815/ https://www.ncbi.nlm.nih.gov/pubmed/23922946 http://dx.doi.org/10.1371/journal.pone.0070153 |
work_keys_str_mv | AT mendozamarianar rfmirtargetpredictinghumanmicrornatargetgeneswitharandomforestclassifier AT dafonsecaguilhermec rfmirtargetpredictinghumanmicrornatargetgeneswitharandomforestclassifier AT lossmoraisguilherme rfmirtargetpredictinghumanmicrornatargetgeneswitharandomforestclassifier AT alvesronnie rfmirtargetpredictinghumanmicrornatargetgeneswitharandomforestclassifier AT margisrogerio rfmirtargetpredictinghumanmicrornatargetgeneswitharandomforestclassifier AT bazzananalc rfmirtargetpredictinghumanmicrornatargetgeneswitharandomforestclassifier |