Cargando…

miRLocator: Machine Learning-Based Prediction of Mature MicroRNAs within Plant Pre-miRNA Sequences

MicroRNAs (miRNAs) are a class of short, non-coding RNA that play regulatory roles in a wide variety of biological processes, such as plant growth and abiotic stress responses. Although several computational tools have been developed to identify primary miRNAs and precursor miRNAs (pre-miRNAs), very...

Descripción completa

Detalles Bibliográficos
Autores principales: Cui, Haibo, Zhai, Jingjing, Ma, Chuang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4641693/
https://www.ncbi.nlm.nih.gov/pubmed/26558614
http://dx.doi.org/10.1371/journal.pone.0142753
_version_ 1782400240292200448
author Cui, Haibo
Zhai, Jingjing
Ma, Chuang
author_facet Cui, Haibo
Zhai, Jingjing
Ma, Chuang
author_sort Cui, Haibo
collection PubMed
description MicroRNAs (miRNAs) are a class of short, non-coding RNA that play regulatory roles in a wide variety of biological processes, such as plant growth and abiotic stress responses. Although several computational tools have been developed to identify primary miRNAs and precursor miRNAs (pre-miRNAs), very few provide the functionality of locating mature miRNAs within plant pre-miRNAs. This manuscript introduces a novel algorithm for predicting miRNAs named miRLocator, which isbased on machine learning techniques and sequence and structural features extracted from miRNA:miRNA* duplexes. To address the class imbalance problem (few real miRNAs and a large number of pseudo miRNAs), the prediction models in miRLocator were optimized by considering critical (and often ignored) factors that can markedly affect the prediction accuracy of mature miRNAs, including the machine learning algorithm and the ratio between training positive and negative samples. Ten-fold cross-validation on 5854 experimentally validated miRNAs from 19 plant species showed that miRLocator performed better than the state-of-art miRNA predictor miRdup in locating mature miRNAs within plant pre-miRNAs. miRLocator will aid researchers interested in discovering miRNAs from model and non-model plant species.
format Online
Article
Text
id pubmed-4641693
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-46416932015-11-18 miRLocator: Machine Learning-Based Prediction of Mature MicroRNAs within Plant Pre-miRNA Sequences Cui, Haibo Zhai, Jingjing Ma, Chuang PLoS One Research Article MicroRNAs (miRNAs) are a class of short, non-coding RNA that play regulatory roles in a wide variety of biological processes, such as plant growth and abiotic stress responses. Although several computational tools have been developed to identify primary miRNAs and precursor miRNAs (pre-miRNAs), very few provide the functionality of locating mature miRNAs within plant pre-miRNAs. This manuscript introduces a novel algorithm for predicting miRNAs named miRLocator, which isbased on machine learning techniques and sequence and structural features extracted from miRNA:miRNA* duplexes. To address the class imbalance problem (few real miRNAs and a large number of pseudo miRNAs), the prediction models in miRLocator were optimized by considering critical (and often ignored) factors that can markedly affect the prediction accuracy of mature miRNAs, including the machine learning algorithm and the ratio between training positive and negative samples. Ten-fold cross-validation on 5854 experimentally validated miRNAs from 19 plant species showed that miRLocator performed better than the state-of-art miRNA predictor miRdup in locating mature miRNAs within plant pre-miRNAs. miRLocator will aid researchers interested in discovering miRNAs from model and non-model plant species. Public Library of Science 2015-11-11 /pmc/articles/PMC4641693/ /pubmed/26558614 http://dx.doi.org/10.1371/journal.pone.0142753 Text en © 2015 Cui et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Cui, Haibo
Zhai, Jingjing
Ma, Chuang
miRLocator: Machine Learning-Based Prediction of Mature MicroRNAs within Plant Pre-miRNA Sequences
title miRLocator: Machine Learning-Based Prediction of Mature MicroRNAs within Plant Pre-miRNA Sequences
title_full miRLocator: Machine Learning-Based Prediction of Mature MicroRNAs within Plant Pre-miRNA Sequences
title_fullStr miRLocator: Machine Learning-Based Prediction of Mature MicroRNAs within Plant Pre-miRNA Sequences
title_full_unstemmed miRLocator: Machine Learning-Based Prediction of Mature MicroRNAs within Plant Pre-miRNA Sequences
title_short miRLocator: Machine Learning-Based Prediction of Mature MicroRNAs within Plant Pre-miRNA Sequences
title_sort mirlocator: machine learning-based prediction of mature micrornas within plant pre-mirna sequences
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4641693/
https://www.ncbi.nlm.nih.gov/pubmed/26558614
http://dx.doi.org/10.1371/journal.pone.0142753
work_keys_str_mv AT cuihaibo mirlocatormachinelearningbasedpredictionofmaturemicrornaswithinplantpremirnasequences
AT zhaijingjing mirlocatormachinelearningbasedpredictionofmaturemicrornaswithinplantpremirnasequences
AT machuang mirlocatormachinelearningbasedpredictionofmaturemicrornaswithinplantpremirnasequences