Cargando…

miPIE: NGS-based Prediction of miRNA Using Integrated Evidence

Methods for the de novo identification of microRNA (miRNA) have been developed using a range of sequence-based features. With the increasing availability of next generation sequencing (NGS) transcriptome data, there is a need for miRNA identification that integrates both NGS transcript expression-ba...

Descripción completa

Detalles Bibliográficos
Autores principales: Peace, R. J., Sheikh Hassani, M., Green, J. R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6367335/
https://www.ncbi.nlm.nih.gov/pubmed/30733467
http://dx.doi.org/10.1038/s41598-018-38107-z
_version_ 1783393768939978752
author Peace, R. J.
Sheikh Hassani, M.
Green, J. R.
author_facet Peace, R. J.
Sheikh Hassani, M.
Green, J. R.
author_sort Peace, R. J.
collection PubMed
description Methods for the de novo identification of microRNA (miRNA) have been developed using a range of sequence-based features. With the increasing availability of next generation sequencing (NGS) transcriptome data, there is a need for miRNA identification that integrates both NGS transcript expression-based patterns as well as advanced genomic sequence-based methods. While miRDeep2 does examine the predicted secondary structure of putative miRNA sequences, it does not leverage many of the sequence-based features used in state-of-the-art de novo methods. Meanwhile, other NGS-based methods, such as miRanalyzer, place an emphasis on sequence-based features without leveraging advanced expression-based features reflecting miRNA biosynthesis. This represents an opportunity to combine the strengths of NGS-based analysis with recent advances in de novo sequence-based miRNA prediction. We here develop a method, microRNA Prediction using Integrated Evidence (miPIE), which integrates both expression-based and sequence-based features to achieve significantly improved miRNA prediction performance. Feature selection identifies the 20 most discriminative features, 3 of which reflect strictly expression-based information. Evaluation using precision-recall curves, for six NGS data sets representing six diverse species, demonstrates substantial improvements in prediction performance compared to three methods: miRDeep2, miRanalyzer, and mirnovo. The individual contributions of expression-based and sequence-based features are also examined and we demonstrate that their combination is more effective than either alone.
format Online
Article
Text
id pubmed-6367335
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-63673352019-02-11 miPIE: NGS-based Prediction of miRNA Using Integrated Evidence Peace, R. J. Sheikh Hassani, M. Green, J. R. Sci Rep Article Methods for the de novo identification of microRNA (miRNA) have been developed using a range of sequence-based features. With the increasing availability of next generation sequencing (NGS) transcriptome data, there is a need for miRNA identification that integrates both NGS transcript expression-based patterns as well as advanced genomic sequence-based methods. While miRDeep2 does examine the predicted secondary structure of putative miRNA sequences, it does not leverage many of the sequence-based features used in state-of-the-art de novo methods. Meanwhile, other NGS-based methods, such as miRanalyzer, place an emphasis on sequence-based features without leveraging advanced expression-based features reflecting miRNA biosynthesis. This represents an opportunity to combine the strengths of NGS-based analysis with recent advances in de novo sequence-based miRNA prediction. We here develop a method, microRNA Prediction using Integrated Evidence (miPIE), which integrates both expression-based and sequence-based features to achieve significantly improved miRNA prediction performance. Feature selection identifies the 20 most discriminative features, 3 of which reflect strictly expression-based information. Evaluation using precision-recall curves, for six NGS data sets representing six diverse species, demonstrates substantial improvements in prediction performance compared to three methods: miRDeep2, miRanalyzer, and mirnovo. The individual contributions of expression-based and sequence-based features are also examined and we demonstrate that their combination is more effective than either alone. Nature Publishing Group UK 2019-02-07 /pmc/articles/PMC6367335/ /pubmed/30733467 http://dx.doi.org/10.1038/s41598-018-38107-z Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Peace, R. J.
Sheikh Hassani, M.
Green, J. R.
miPIE: NGS-based Prediction of miRNA Using Integrated Evidence
title miPIE: NGS-based Prediction of miRNA Using Integrated Evidence
title_full miPIE: NGS-based Prediction of miRNA Using Integrated Evidence
title_fullStr miPIE: NGS-based Prediction of miRNA Using Integrated Evidence
title_full_unstemmed miPIE: NGS-based Prediction of miRNA Using Integrated Evidence
title_short miPIE: NGS-based Prediction of miRNA Using Integrated Evidence
title_sort mipie: ngs-based prediction of mirna using integrated evidence
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6367335/
https://www.ncbi.nlm.nih.gov/pubmed/30733467
http://dx.doi.org/10.1038/s41598-018-38107-z
work_keys_str_mv AT peacerj mipiengsbasedpredictionofmirnausingintegratedevidence
AT sheikhhassanim mipiengsbasedpredictionofmirnausingintegratedevidence
AT greenjr mipiengsbasedpredictionofmirnausingintegratedevidence