Cargando…
MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features
To distinguish the real pre-miRNAs from other hairpin sequences with similar stem-loops (pseudo pre-miRNAs), a hybrid feature which consists of local contiguous structure-sequence composition, minimum of free energy (MFE) of the secondary structure and P-value of randomization test is used. Besides,...
Autores principales: | , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1933124/ https://www.ncbi.nlm.nih.gov/pubmed/17553836 http://dx.doi.org/10.1093/nar/gkm368 |
_version_ | 1782134291162988544 |
---|---|
author | Jiang, Peng Wu, Haonan Wang, Wenkai Ma, Wei Sun, Xiao Lu, Zuhong |
author_facet | Jiang, Peng Wu, Haonan Wang, Wenkai Ma, Wei Sun, Xiao Lu, Zuhong |
author_sort | Jiang, Peng |
collection | PubMed |
description | To distinguish the real pre-miRNAs from other hairpin sequences with similar stem-loops (pseudo pre-miRNAs), a hybrid feature which consists of local contiguous structure-sequence composition, minimum of free energy (MFE) of the secondary structure and P-value of randomization test is used. Besides, a novel machine-learning algorithm, random forest (RF), is introduced. The results suggest that our method predicts at 98.21% specificity and 95.09% sensitivity. When compared with the previous study, Triplet-SVM-classifier, our RF method was nearly 10% greater in total accuracy. Further analysis indicated that the improvement was due to both the combined features and the RF algorithm. The MiPred web server is available at http://www.bioinf.seu.edu.cn/miRNA/. Given a sequence, MiPred decides whether it is a pre-miRNA-like hairpin sequence or not. If the sequence is a pre-miRNA-like hairpin, the RF classifier will predict whether it is a real pre-miRNA or a pseudo one. |
format | Text |
id | pubmed-1933124 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-19331242007-07-31 MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features Jiang, Peng Wu, Haonan Wang, Wenkai Ma, Wei Sun, Xiao Lu, Zuhong Nucleic Acids Res Articles To distinguish the real pre-miRNAs from other hairpin sequences with similar stem-loops (pseudo pre-miRNAs), a hybrid feature which consists of local contiguous structure-sequence composition, minimum of free energy (MFE) of the secondary structure and P-value of randomization test is used. Besides, a novel machine-learning algorithm, random forest (RF), is introduced. The results suggest that our method predicts at 98.21% specificity and 95.09% sensitivity. When compared with the previous study, Triplet-SVM-classifier, our RF method was nearly 10% greater in total accuracy. Further analysis indicated that the improvement was due to both the combined features and the RF algorithm. The MiPred web server is available at http://www.bioinf.seu.edu.cn/miRNA/. Given a sequence, MiPred decides whether it is a pre-miRNA-like hairpin sequence or not. If the sequence is a pre-miRNA-like hairpin, the RF classifier will predict whether it is a real pre-miRNA or a pseudo one. Oxford University Press 2007-07 2007-06-06 /pmc/articles/PMC1933124/ /pubmed/17553836 http://dx.doi.org/10.1093/nar/gkm368 Text en © 2007 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Articles Jiang, Peng Wu, Haonan Wang, Wenkai Ma, Wei Sun, Xiao Lu, Zuhong MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features |
title | MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features |
title_full | MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features |
title_fullStr | MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features |
title_full_unstemmed | MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features |
title_short | MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features |
title_sort | mipred: classification of real and pseudo microrna precursors using random forest prediction model with combined features |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1933124/ https://www.ncbi.nlm.nih.gov/pubmed/17553836 http://dx.doi.org/10.1093/nar/gkm368 |
work_keys_str_mv | AT jiangpeng mipredclassificationofrealandpseudomicrornaprecursorsusingrandomforestpredictionmodelwithcombinedfeatures AT wuhaonan mipredclassificationofrealandpseudomicrornaprecursorsusingrandomforestpredictionmodelwithcombinedfeatures AT wangwenkai mipredclassificationofrealandpseudomicrornaprecursorsusingrandomforestpredictionmodelwithcombinedfeatures AT mawei mipredclassificationofrealandpseudomicrornaprecursorsusingrandomforestpredictionmodelwithcombinedfeatures AT sunxiao mipredclassificationofrealandpseudomicrornaprecursorsusingrandomforestpredictionmodelwithcombinedfeatures AT luzuhong mipredclassificationofrealandpseudomicrornaprecursorsusingrandomforestpredictionmodelwithcombinedfeatures |