Cargando…

MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features

To distinguish the real pre-miRNAs from other hairpin sequences with similar stem-loops (pseudo pre-miRNAs), a hybrid feature which consists of local contiguous structure-sequence composition, minimum of free energy (MFE) of the secondary structure and P-value of randomization test is used. Besides,...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Peng, Wu, Haonan, Wang, Wenkai, Ma, Wei, Sun, Xiao, Lu, Zuhong
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1933124/
https://www.ncbi.nlm.nih.gov/pubmed/17553836
http://dx.doi.org/10.1093/nar/gkm368
_version_ 1782134291162988544
author Jiang, Peng
Wu, Haonan
Wang, Wenkai
Ma, Wei
Sun, Xiao
Lu, Zuhong
author_facet Jiang, Peng
Wu, Haonan
Wang, Wenkai
Ma, Wei
Sun, Xiao
Lu, Zuhong
author_sort Jiang, Peng
collection PubMed
description To distinguish the real pre-miRNAs from other hairpin sequences with similar stem-loops (pseudo pre-miRNAs), a hybrid feature which consists of local contiguous structure-sequence composition, minimum of free energy (MFE) of the secondary structure and P-value of randomization test is used. Besides, a novel machine-learning algorithm, random forest (RF), is introduced. The results suggest that our method predicts at 98.21% specificity and 95.09% sensitivity. When compared with the previous study, Triplet-SVM-classifier, our RF method was nearly 10% greater in total accuracy. Further analysis indicated that the improvement was due to both the combined features and the RF algorithm. The MiPred web server is available at http://www.bioinf.seu.edu.cn/miRNA/. Given a sequence, MiPred decides whether it is a pre-miRNA-like hairpin sequence or not. If the sequence is a pre-miRNA-like hairpin, the RF classifier will predict whether it is a real pre-miRNA or a pseudo one.
format Text
id pubmed-1933124
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-19331242007-07-31 MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features Jiang, Peng Wu, Haonan Wang, Wenkai Ma, Wei Sun, Xiao Lu, Zuhong Nucleic Acids Res Articles To distinguish the real pre-miRNAs from other hairpin sequences with similar stem-loops (pseudo pre-miRNAs), a hybrid feature which consists of local contiguous structure-sequence composition, minimum of free energy (MFE) of the secondary structure and P-value of randomization test is used. Besides, a novel machine-learning algorithm, random forest (RF), is introduced. The results suggest that our method predicts at 98.21% specificity and 95.09% sensitivity. When compared with the previous study, Triplet-SVM-classifier, our RF method was nearly 10% greater in total accuracy. Further analysis indicated that the improvement was due to both the combined features and the RF algorithm. The MiPred web server is available at http://www.bioinf.seu.edu.cn/miRNA/. Given a sequence, MiPred decides whether it is a pre-miRNA-like hairpin sequence or not. If the sequence is a pre-miRNA-like hairpin, the RF classifier will predict whether it is a real pre-miRNA or a pseudo one. Oxford University Press 2007-07 2007-06-06 /pmc/articles/PMC1933124/ /pubmed/17553836 http://dx.doi.org/10.1093/nar/gkm368 Text en © 2007 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Jiang, Peng
Wu, Haonan
Wang, Wenkai
Ma, Wei
Sun, Xiao
Lu, Zuhong
MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features
title MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features
title_full MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features
title_fullStr MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features
title_full_unstemmed MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features
title_short MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features
title_sort mipred: classification of real and pseudo microrna precursors using random forest prediction model with combined features
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1933124/
https://www.ncbi.nlm.nih.gov/pubmed/17553836
http://dx.doi.org/10.1093/nar/gkm368
work_keys_str_mv AT jiangpeng mipredclassificationofrealandpseudomicrornaprecursorsusingrandomforestpredictionmodelwithcombinedfeatures
AT wuhaonan mipredclassificationofrealandpseudomicrornaprecursorsusingrandomforestpredictionmodelwithcombinedfeatures
AT wangwenkai mipredclassificationofrealandpseudomicrornaprecursorsusingrandomforestpredictionmodelwithcombinedfeatures
AT mawei mipredclassificationofrealandpseudomicrornaprecursorsusingrandomforestpredictionmodelwithcombinedfeatures
AT sunxiao mipredclassificationofrealandpseudomicrornaprecursorsusingrandomforestpredictionmodelwithcombinedfeatures
AT luzuhong mipredclassificationofrealandpseudomicrornaprecursorsusingrandomforestpredictionmodelwithcombinedfeatures