Cargando…

Prediction of MicroRNA Precursors Using Parsimonious Feature Sets

MicroRNAs (miRNAs) are a class of short noncoding RNAs that regulate gene expression through base pairing with messenger RNAs. Due to the interest in studying miRNA dysregulation in disease and limits of validated miRNA references, identification of novel miRNAs is a critical task. The performance o...

Descripción completa

Detalles Bibliográficos
Autores principales: Stepanowsky, Petra, Levy, Eric, Kim, Jihoon, Jiang, Xiaoqian, Ohno-Machado, Lucila
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Libertas Academica 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4216048/
https://www.ncbi.nlm.nih.gov/pubmed/25392687
http://dx.doi.org/10.4137/CIN.S13877
_version_ 1782342201517277184
author Stepanowsky, Petra
Levy, Eric
Kim, Jihoon
Jiang, Xiaoqian
Ohno-Machado, Lucila
author_facet Stepanowsky, Petra
Levy, Eric
Kim, Jihoon
Jiang, Xiaoqian
Ohno-Machado, Lucila
author_sort Stepanowsky, Petra
collection PubMed
description MicroRNAs (miRNAs) are a class of short noncoding RNAs that regulate gene expression through base pairing with messenger RNAs. Due to the interest in studying miRNA dysregulation in disease and limits of validated miRNA references, identification of novel miRNAs is a critical task. The performance of different models to predict novel miRNAs varies with the features chosen as predictors. However, no study has systematically compared published feature sets. We constructed a comprehensive feature set using the minimum free energy of the secondary structure of precursor miRNAs, a set of nucleotide-structure triplets, and additional extracted sequence and structure characteristics. We then compared the predictive value of our comprehensive feature set to those from three previously published studies, using logistic regression and random forest classifiers. We found that classifiers containing as few as seven highly predictive features are able to predict novel precursor miRNAs as well as classifiers that use larger feature sets. In a real data set, our method correctly identified the holdout miRNAs relevant to renal cancer.
format Online
Article
Text
id pubmed-4216048
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-42160482014-11-12 Prediction of MicroRNA Precursors Using Parsimonious Feature Sets Stepanowsky, Petra Levy, Eric Kim, Jihoon Jiang, Xiaoqian Ohno-Machado, Lucila Cancer Inform Review MicroRNAs (miRNAs) are a class of short noncoding RNAs that regulate gene expression through base pairing with messenger RNAs. Due to the interest in studying miRNA dysregulation in disease and limits of validated miRNA references, identification of novel miRNAs is a critical task. The performance of different models to predict novel miRNAs varies with the features chosen as predictors. However, no study has systematically compared published feature sets. We constructed a comprehensive feature set using the minimum free energy of the secondary structure of precursor miRNAs, a set of nucleotide-structure triplets, and additional extracted sequence and structure characteristics. We then compared the predictive value of our comprehensive feature set to those from three previously published studies, using logistic regression and random forest classifiers. We found that classifiers containing as few as seven highly predictive features are able to predict novel precursor miRNAs as well as classifiers that use larger feature sets. In a real data set, our method correctly identified the holdout miRNAs relevant to renal cancer. Libertas Academica 2014-10-14 /pmc/articles/PMC4216048/ /pubmed/25392687 http://dx.doi.org/10.4137/CIN.S13877 Text en © 2014 the author(s), publisher and licensee Libertas Academica Ltd. This is an open-access article distributed under the terms of the Creative Commons CC-BY-NC 3.0 License.
spellingShingle Review
Stepanowsky, Petra
Levy, Eric
Kim, Jihoon
Jiang, Xiaoqian
Ohno-Machado, Lucila
Prediction of MicroRNA Precursors Using Parsimonious Feature Sets
title Prediction of MicroRNA Precursors Using Parsimonious Feature Sets
title_full Prediction of MicroRNA Precursors Using Parsimonious Feature Sets
title_fullStr Prediction of MicroRNA Precursors Using Parsimonious Feature Sets
title_full_unstemmed Prediction of MicroRNA Precursors Using Parsimonious Feature Sets
title_short Prediction of MicroRNA Precursors Using Parsimonious Feature Sets
title_sort prediction of microrna precursors using parsimonious feature sets
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4216048/
https://www.ncbi.nlm.nih.gov/pubmed/25392687
http://dx.doi.org/10.4137/CIN.S13877
work_keys_str_mv AT stepanowskypetra predictionofmicrornaprecursorsusingparsimoniousfeaturesets
AT levyeric predictionofmicrornaprecursorsusingparsimoniousfeaturesets
AT kimjihoon predictionofmicrornaprecursorsusingparsimoniousfeaturesets
AT jiangxiaoqian predictionofmicrornaprecursorsusingparsimoniousfeaturesets
AT ohnomachadolucila predictionofmicrornaprecursorsusingparsimoniousfeaturesets