Cargando…

RFMirTarget: Predicting Human MicroRNA Target Genes with a Random Forest Classifier

MicroRNAs are key regulators of eukaryotic gene expression whose fundamental role has already been identified in many cell pathways. The correct identification of miRNAs targets is still a major challenge in bioinformatics and has motivated the development of several computational methods to overcom...

Descripción completa

Detalles Bibliográficos
Autores principales: Mendoza, Mariana R., da Fonseca, Guilherme C., Loss-Morais, Guilherme, Alves, Ronnie, Margis, Rogerio, Bazzan, Ana L. C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3724815/
https://www.ncbi.nlm.nih.gov/pubmed/23922946
http://dx.doi.org/10.1371/journal.pone.0070153
_version_ 1782476727619944448
author Mendoza, Mariana R.
da Fonseca, Guilherme C.
Loss-Morais, Guilherme
Alves, Ronnie
Margis, Rogerio
Bazzan, Ana L. C.
author_facet Mendoza, Mariana R.
da Fonseca, Guilherme C.
Loss-Morais, Guilherme
Alves, Ronnie
Margis, Rogerio
Bazzan, Ana L. C.
author_sort Mendoza, Mariana R.
collection PubMed
description MicroRNAs are key regulators of eukaryotic gene expression whose fundamental role has already been identified in many cell pathways. The correct identification of miRNAs targets is still a major challenge in bioinformatics and has motivated the development of several computational methods to overcome inherent limitations of experimental analysis. Indeed, the best results reported so far in terms of specificity and sensitivity are associated to machine learning-based methods for microRNA-target prediction. Following this trend, in the current paper we discuss and explore a microRNA-target prediction method based on a random forest classifier, namely RFMirTarget. Despite its well-known robustness regarding general classifying tasks, to the best of our knowledge, random forest have not been deeply explored for the specific context of predicting microRNAs targets. Our framework first analyzes alignments between candidate microRNA-target pairs and extracts a set of structural, thermodynamics, alignment, seed and position-based features, upon which classification is performed. Experiments have shown that RFMirTarget outperforms several well-known classifiers with statistical significance, and that its performance is not impaired by the class imbalance problem or features correlation. Moreover, comparing it against other algorithms for microRNA target prediction using independent test data sets from TarBase and starBase, we observe a very promising performance, with higher sensitivity in relation to other methods. Finally, tests performed with RFMirTarget show the benefits of feature selection even for a classifier with embedded feature importance analysis, and the consistency between relevant features identified and important biological properties for effective microRNA-target gene alignment.
format Online
Article
Text
id pubmed-3724815
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-37248152013-08-06 RFMirTarget: Predicting Human MicroRNA Target Genes with a Random Forest Classifier Mendoza, Mariana R. da Fonseca, Guilherme C. Loss-Morais, Guilherme Alves, Ronnie Margis, Rogerio Bazzan, Ana L. C. PLoS One Research Article MicroRNAs are key regulators of eukaryotic gene expression whose fundamental role has already been identified in many cell pathways. The correct identification of miRNAs targets is still a major challenge in bioinformatics and has motivated the development of several computational methods to overcome inherent limitations of experimental analysis. Indeed, the best results reported so far in terms of specificity and sensitivity are associated to machine learning-based methods for microRNA-target prediction. Following this trend, in the current paper we discuss and explore a microRNA-target prediction method based on a random forest classifier, namely RFMirTarget. Despite its well-known robustness regarding general classifying tasks, to the best of our knowledge, random forest have not been deeply explored for the specific context of predicting microRNAs targets. Our framework first analyzes alignments between candidate microRNA-target pairs and extracts a set of structural, thermodynamics, alignment, seed and position-based features, upon which classification is performed. Experiments have shown that RFMirTarget outperforms several well-known classifiers with statistical significance, and that its performance is not impaired by the class imbalance problem or features correlation. Moreover, comparing it against other algorithms for microRNA target prediction using independent test data sets from TarBase and starBase, we observe a very promising performance, with higher sensitivity in relation to other methods. Finally, tests performed with RFMirTarget show the benefits of feature selection even for a classifier with embedded feature importance analysis, and the consistency between relevant features identified and important biological properties for effective microRNA-target gene alignment. Public Library of Science 2013-07-26 /pmc/articles/PMC3724815/ /pubmed/23922946 http://dx.doi.org/10.1371/journal.pone.0070153 Text en © 2013 Mendoza et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Mendoza, Mariana R.
da Fonseca, Guilherme C.
Loss-Morais, Guilherme
Alves, Ronnie
Margis, Rogerio
Bazzan, Ana L. C.
RFMirTarget: Predicting Human MicroRNA Target Genes with a Random Forest Classifier
title RFMirTarget: Predicting Human MicroRNA Target Genes with a Random Forest Classifier
title_full RFMirTarget: Predicting Human MicroRNA Target Genes with a Random Forest Classifier
title_fullStr RFMirTarget: Predicting Human MicroRNA Target Genes with a Random Forest Classifier
title_full_unstemmed RFMirTarget: Predicting Human MicroRNA Target Genes with a Random Forest Classifier
title_short RFMirTarget: Predicting Human MicroRNA Target Genes with a Random Forest Classifier
title_sort rfmirtarget: predicting human microrna target genes with a random forest classifier
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3724815/
https://www.ncbi.nlm.nih.gov/pubmed/23922946
http://dx.doi.org/10.1371/journal.pone.0070153
work_keys_str_mv AT mendozamarianar rfmirtargetpredictinghumanmicrornatargetgeneswitharandomforestclassifier
AT dafonsecaguilhermec rfmirtargetpredictinghumanmicrornatargetgeneswitharandomforestclassifier
AT lossmoraisguilherme rfmirtargetpredictinghumanmicrornatargetgeneswitharandomforestclassifier
AT alvesronnie rfmirtargetpredictinghumanmicrornatargetgeneswitharandomforestclassifier
AT margisrogerio rfmirtargetpredictinghumanmicrornatargetgeneswitharandomforestclassifier
AT bazzananalc rfmirtargetpredictinghumanmicrornatargetgeneswitharandomforestclassifier