Cargando…

Prediction of Mature MicroRNA and Piwi-Interacting RNA without a Genome Reference or Precursors

The discovery of novel microRNA (miRNA) and piwi-interacting RNA (piRNA) is an important task for the understanding of many biological processes. Most of the available miRNA and piRNA identification methods are dependent on the availability of the organism’s genome sequence and the quality of its an...

Descripción completa

Detalles Bibliográficos
Autores principales: Menor, Mark S., Baek, Kyungim, Poisson, Guylaine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4307313/
https://www.ncbi.nlm.nih.gov/pubmed/25580537
http://dx.doi.org/10.3390/ijms16011466
_version_ 1782354445115326464
author Menor, Mark S.
Baek, Kyungim
Poisson, Guylaine
author_facet Menor, Mark S.
Baek, Kyungim
Poisson, Guylaine
author_sort Menor, Mark S.
collection PubMed
description The discovery of novel microRNA (miRNA) and piwi-interacting RNA (piRNA) is an important task for the understanding of many biological processes. Most of the available miRNA and piRNA identification methods are dependent on the availability of the organism’s genome sequence and the quality of its annotation. Therefore, an efficient prediction method based solely on the short RNA reads and requiring no genomic information is highly desirable. In this study, we propose an approach that relies primarily on the nucleotide composition of the read and does not require reference genomes of related species for prediction. Using an empirical Bayesian kernel method and the error correcting output codes framework, compact models suitable for large-scale analyses are built on databases of known mature miRNAs and piRNAs. We found that the usage of an L(1)-based Gaussian kernel can double the true positive rate compared to the standard L(2)-based Gaussian kernel. Our approach can increase the true positive rate by at most 60% compared to the existing piRNA predictor based on the analysis of a hold-out test set. Using experimental data, we also show that our approach can detect about an order of magnitude or more known miRNAs than the mature miRNA predictor, miRPlex.
format Online
Article
Text
id pubmed-4307313
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-43073132015-02-02 Prediction of Mature MicroRNA and Piwi-Interacting RNA without a Genome Reference or Precursors Menor, Mark S. Baek, Kyungim Poisson, Guylaine Int J Mol Sci Article The discovery of novel microRNA (miRNA) and piwi-interacting RNA (piRNA) is an important task for the understanding of many biological processes. Most of the available miRNA and piRNA identification methods are dependent on the availability of the organism’s genome sequence and the quality of its annotation. Therefore, an efficient prediction method based solely on the short RNA reads and requiring no genomic information is highly desirable. In this study, we propose an approach that relies primarily on the nucleotide composition of the read and does not require reference genomes of related species for prediction. Using an empirical Bayesian kernel method and the error correcting output codes framework, compact models suitable for large-scale analyses are built on databases of known mature miRNAs and piRNAs. We found that the usage of an L(1)-based Gaussian kernel can double the true positive rate compared to the standard L(2)-based Gaussian kernel. Our approach can increase the true positive rate by at most 60% compared to the existing piRNA predictor based on the analysis of a hold-out test set. Using experimental data, we also show that our approach can detect about an order of magnitude or more known miRNAs than the mature miRNA predictor, miRPlex. MDPI 2015-01-08 /pmc/articles/PMC4307313/ /pubmed/25580537 http://dx.doi.org/10.3390/ijms16011466 Text en © 2015 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Menor, Mark S.
Baek, Kyungim
Poisson, Guylaine
Prediction of Mature MicroRNA and Piwi-Interacting RNA without a Genome Reference or Precursors
title Prediction of Mature MicroRNA and Piwi-Interacting RNA without a Genome Reference or Precursors
title_full Prediction of Mature MicroRNA and Piwi-Interacting RNA without a Genome Reference or Precursors
title_fullStr Prediction of Mature MicroRNA and Piwi-Interacting RNA without a Genome Reference or Precursors
title_full_unstemmed Prediction of Mature MicroRNA and Piwi-Interacting RNA without a Genome Reference or Precursors
title_short Prediction of Mature MicroRNA and Piwi-Interacting RNA without a Genome Reference or Precursors
title_sort prediction of mature microrna and piwi-interacting rna without a genome reference or precursors
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4307313/
https://www.ncbi.nlm.nih.gov/pubmed/25580537
http://dx.doi.org/10.3390/ijms16011466
work_keys_str_mv AT menormarks predictionofmaturemicrornaandpiwiinteractingrnawithoutagenomereferenceorprecursors
AT baekkyungim predictionofmaturemicrornaandpiwiinteractingrnawithoutagenomereferenceorprecursors
AT poissonguylaine predictionofmaturemicrornaandpiwiinteractingrnawithoutagenomereferenceorprecursors