Cargando…
Prediction of MicroRNA Precursors Using Parsimonious Feature Sets
MicroRNAs (miRNAs) are a class of short noncoding RNAs that regulate gene expression through base pairing with messenger RNAs. Due to the interest in studying miRNA dysregulation in disease and limits of validated miRNA references, identification of novel miRNAs is a critical task. The performance o...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Libertas Academica
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4216048/ https://www.ncbi.nlm.nih.gov/pubmed/25392687 http://dx.doi.org/10.4137/CIN.S13877 |
_version_ | 1782342201517277184 |
---|---|
author | Stepanowsky, Petra Levy, Eric Kim, Jihoon Jiang, Xiaoqian Ohno-Machado, Lucila |
author_facet | Stepanowsky, Petra Levy, Eric Kim, Jihoon Jiang, Xiaoqian Ohno-Machado, Lucila |
author_sort | Stepanowsky, Petra |
collection | PubMed |
description | MicroRNAs (miRNAs) are a class of short noncoding RNAs that regulate gene expression through base pairing with messenger RNAs. Due to the interest in studying miRNA dysregulation in disease and limits of validated miRNA references, identification of novel miRNAs is a critical task. The performance of different models to predict novel miRNAs varies with the features chosen as predictors. However, no study has systematically compared published feature sets. We constructed a comprehensive feature set using the minimum free energy of the secondary structure of precursor miRNAs, a set of nucleotide-structure triplets, and additional extracted sequence and structure characteristics. We then compared the predictive value of our comprehensive feature set to those from three previously published studies, using logistic regression and random forest classifiers. We found that classifiers containing as few as seven highly predictive features are able to predict novel precursor miRNAs as well as classifiers that use larger feature sets. In a real data set, our method correctly identified the holdout miRNAs relevant to renal cancer. |
format | Online Article Text |
id | pubmed-4216048 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Libertas Academica |
record_format | MEDLINE/PubMed |
spelling | pubmed-42160482014-11-12 Prediction of MicroRNA Precursors Using Parsimonious Feature Sets Stepanowsky, Petra Levy, Eric Kim, Jihoon Jiang, Xiaoqian Ohno-Machado, Lucila Cancer Inform Review MicroRNAs (miRNAs) are a class of short noncoding RNAs that regulate gene expression through base pairing with messenger RNAs. Due to the interest in studying miRNA dysregulation in disease and limits of validated miRNA references, identification of novel miRNAs is a critical task. The performance of different models to predict novel miRNAs varies with the features chosen as predictors. However, no study has systematically compared published feature sets. We constructed a comprehensive feature set using the minimum free energy of the secondary structure of precursor miRNAs, a set of nucleotide-structure triplets, and additional extracted sequence and structure characteristics. We then compared the predictive value of our comprehensive feature set to those from three previously published studies, using logistic regression and random forest classifiers. We found that classifiers containing as few as seven highly predictive features are able to predict novel precursor miRNAs as well as classifiers that use larger feature sets. In a real data set, our method correctly identified the holdout miRNAs relevant to renal cancer. Libertas Academica 2014-10-14 /pmc/articles/PMC4216048/ /pubmed/25392687 http://dx.doi.org/10.4137/CIN.S13877 Text en © 2014 the author(s), publisher and licensee Libertas Academica Ltd. This is an open-access article distributed under the terms of the Creative Commons CC-BY-NC 3.0 License. |
spellingShingle | Review Stepanowsky, Petra Levy, Eric Kim, Jihoon Jiang, Xiaoqian Ohno-Machado, Lucila Prediction of MicroRNA Precursors Using Parsimonious Feature Sets |
title | Prediction of MicroRNA Precursors Using Parsimonious Feature Sets |
title_full | Prediction of MicroRNA Precursors Using Parsimonious Feature Sets |
title_fullStr | Prediction of MicroRNA Precursors Using Parsimonious Feature Sets |
title_full_unstemmed | Prediction of MicroRNA Precursors Using Parsimonious Feature Sets |
title_short | Prediction of MicroRNA Precursors Using Parsimonious Feature Sets |
title_sort | prediction of microrna precursors using parsimonious feature sets |
topic | Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4216048/ https://www.ncbi.nlm.nih.gov/pubmed/25392687 http://dx.doi.org/10.4137/CIN.S13877 |
work_keys_str_mv | AT stepanowskypetra predictionofmicrornaprecursorsusingparsimoniousfeaturesets AT levyeric predictionofmicrornaprecursorsusingparsimoniousfeaturesets AT kimjihoon predictionofmicrornaprecursorsusingparsimoniousfeaturesets AT jiangxiaoqian predictionofmicrornaprecursorsusingparsimoniousfeaturesets AT ohnomachadolucila predictionofmicrornaprecursorsusingparsimoniousfeaturesets |