Cargando…

MaturePred: Efficient Identification of MicroRNAs within Novel Plant Pre-miRNAs

BACKGROUND: MicroRNAs (miRNAs) are a set of short (19∼24 nt) non-coding RNAs that play significant roles as posttranscriptional regulators in animals and plants. The ab initio prediction methods show excellent performance for discovering new pre-miRNAs. While most of these methods can distinguish re...

Descripción completa

Detalles Bibliográficos
Autores principales: Xuan, Ping, Guo, Maozu, Huang, Yangchao, Li, Wenbin, Huang, Yufei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3217989/
https://www.ncbi.nlm.nih.gov/pubmed/22110646
http://dx.doi.org/10.1371/journal.pone.0027422
_version_ 1782216647851900928
author Xuan, Ping
Guo, Maozu
Huang, Yangchao
Li, Wenbin
Huang, Yufei
author_facet Xuan, Ping
Guo, Maozu
Huang, Yangchao
Li, Wenbin
Huang, Yufei
author_sort Xuan, Ping
collection PubMed
description BACKGROUND: MicroRNAs (miRNAs) are a set of short (19∼24 nt) non-coding RNAs that play significant roles as posttranscriptional regulators in animals and plants. The ab initio prediction methods show excellent performance for discovering new pre-miRNAs. While most of these methods can distinguish real pre-miRNAs from pseudo pre-miRNAs, few can predict the positions of miRNAs. Among the existing methods that can also predict the miRNA positions, most of them are designed for mammalian miRNAs, including human and mouse. Minority of methods can predict the positions of plant miRNAs. Accurate prediction of the miRNA positions remains a challenge, especially for plant miRNAs. This motivates us to develop MaturePred, a machine learning method based on support vector machine, to predict the positions of plant miRNAs for the new plant pre-miRNA candidates. METHODOLOGY/PRINCIPAL FINDINGS: A miRNA:miRNA* duplex is regarded as a whole to capture the binding characteristics of miRNAs. We extract the position-specific features, the energy related features, the structure related features, and stability related features from real/pseudo miRNA:miRNA* duplexes. A set of informative features are selected to improve the prediction accuracy. Two-stage sample selection algorithm is proposed to combat the serious imbalance problem between real and pseudo miRNA:miRNA* duplexes. The prediction method, MaturePred, can accurately predict plant miRNAs and achieve higher prediction accuracy compared with the existing methods. Further, we trained a prediction model with animal data to predict animal miRNAs. The model also achieves higher prediction performance. It further confirms the efficiency of our miRNA prediction method. CONCLUSIONS: The superior performance of the proposed prediction model can be attributed to the extracted features of plant miRNAs and miRNA*s, the selected training dataset, and the carefully selected features. The web service of MaturePred, the training datasets, the testing datasets, and the selected features are freely available at http://nclab.hit.edu.cn/maturepred/.
format Online
Article
Text
id pubmed-3217989
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-32179892011-11-21 MaturePred: Efficient Identification of MicroRNAs within Novel Plant Pre-miRNAs Xuan, Ping Guo, Maozu Huang, Yangchao Li, Wenbin Huang, Yufei PLoS One Research Article BACKGROUND: MicroRNAs (miRNAs) are a set of short (19∼24 nt) non-coding RNAs that play significant roles as posttranscriptional regulators in animals and plants. The ab initio prediction methods show excellent performance for discovering new pre-miRNAs. While most of these methods can distinguish real pre-miRNAs from pseudo pre-miRNAs, few can predict the positions of miRNAs. Among the existing methods that can also predict the miRNA positions, most of them are designed for mammalian miRNAs, including human and mouse. Minority of methods can predict the positions of plant miRNAs. Accurate prediction of the miRNA positions remains a challenge, especially for plant miRNAs. This motivates us to develop MaturePred, a machine learning method based on support vector machine, to predict the positions of plant miRNAs for the new plant pre-miRNA candidates. METHODOLOGY/PRINCIPAL FINDINGS: A miRNA:miRNA* duplex is regarded as a whole to capture the binding characteristics of miRNAs. We extract the position-specific features, the energy related features, the structure related features, and stability related features from real/pseudo miRNA:miRNA* duplexes. A set of informative features are selected to improve the prediction accuracy. Two-stage sample selection algorithm is proposed to combat the serious imbalance problem between real and pseudo miRNA:miRNA* duplexes. The prediction method, MaturePred, can accurately predict plant miRNAs and achieve higher prediction accuracy compared with the existing methods. Further, we trained a prediction model with animal data to predict animal miRNAs. The model also achieves higher prediction performance. It further confirms the efficiency of our miRNA prediction method. CONCLUSIONS: The superior performance of the proposed prediction model can be attributed to the extracted features of plant miRNAs and miRNA*s, the selected training dataset, and the carefully selected features. The web service of MaturePred, the training datasets, the testing datasets, and the selected features are freely available at http://nclab.hit.edu.cn/maturepred/. Public Library of Science 2011-11-16 /pmc/articles/PMC3217989/ /pubmed/22110646 http://dx.doi.org/10.1371/journal.pone.0027422 Text en Xuan et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Xuan, Ping
Guo, Maozu
Huang, Yangchao
Li, Wenbin
Huang, Yufei
MaturePred: Efficient Identification of MicroRNAs within Novel Plant Pre-miRNAs
title MaturePred: Efficient Identification of MicroRNAs within Novel Plant Pre-miRNAs
title_full MaturePred: Efficient Identification of MicroRNAs within Novel Plant Pre-miRNAs
title_fullStr MaturePred: Efficient Identification of MicroRNAs within Novel Plant Pre-miRNAs
title_full_unstemmed MaturePred: Efficient Identification of MicroRNAs within Novel Plant Pre-miRNAs
title_short MaturePred: Efficient Identification of MicroRNAs within Novel Plant Pre-miRNAs
title_sort maturepred: efficient identification of micrornas within novel plant pre-mirnas
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3217989/
https://www.ncbi.nlm.nih.gov/pubmed/22110646
http://dx.doi.org/10.1371/journal.pone.0027422
work_keys_str_mv AT xuanping maturepredefficientidentificationofmicrornaswithinnovelplantpremirnas
AT guomaozu maturepredefficientidentificationofmicrornaswithinnovelplantpremirnas
AT huangyangchao maturepredefficientidentificationofmicrornaswithinnovelplantpremirnas
AT liwenbin maturepredefficientidentificationofmicrornaswithinnovelplantpremirnas
AT huangyufei maturepredefficientidentificationofmicrornaswithinnovelplantpremirnas