Cargando…
miPIE: NGS-based Prediction of miRNA Using Integrated Evidence
Methods for the de novo identification of microRNA (miRNA) have been developed using a range of sequence-based features. With the increasing availability of next generation sequencing (NGS) transcriptome data, there is a need for miRNA identification that integrates both NGS transcript expression-ba...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6367335/ https://www.ncbi.nlm.nih.gov/pubmed/30733467 http://dx.doi.org/10.1038/s41598-018-38107-z |
_version_ | 1783393768939978752 |
---|---|
author | Peace, R. J. Sheikh Hassani, M. Green, J. R. |
author_facet | Peace, R. J. Sheikh Hassani, M. Green, J. R. |
author_sort | Peace, R. J. |
collection | PubMed |
description | Methods for the de novo identification of microRNA (miRNA) have been developed using a range of sequence-based features. With the increasing availability of next generation sequencing (NGS) transcriptome data, there is a need for miRNA identification that integrates both NGS transcript expression-based patterns as well as advanced genomic sequence-based methods. While miRDeep2 does examine the predicted secondary structure of putative miRNA sequences, it does not leverage many of the sequence-based features used in state-of-the-art de novo methods. Meanwhile, other NGS-based methods, such as miRanalyzer, place an emphasis on sequence-based features without leveraging advanced expression-based features reflecting miRNA biosynthesis. This represents an opportunity to combine the strengths of NGS-based analysis with recent advances in de novo sequence-based miRNA prediction. We here develop a method, microRNA Prediction using Integrated Evidence (miPIE), which integrates both expression-based and sequence-based features to achieve significantly improved miRNA prediction performance. Feature selection identifies the 20 most discriminative features, 3 of which reflect strictly expression-based information. Evaluation using precision-recall curves, for six NGS data sets representing six diverse species, demonstrates substantial improvements in prediction performance compared to three methods: miRDeep2, miRanalyzer, and mirnovo. The individual contributions of expression-based and sequence-based features are also examined and we demonstrate that their combination is more effective than either alone. |
format | Online Article Text |
id | pubmed-6367335 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-63673352019-02-11 miPIE: NGS-based Prediction of miRNA Using Integrated Evidence Peace, R. J. Sheikh Hassani, M. Green, J. R. Sci Rep Article Methods for the de novo identification of microRNA (miRNA) have been developed using a range of sequence-based features. With the increasing availability of next generation sequencing (NGS) transcriptome data, there is a need for miRNA identification that integrates both NGS transcript expression-based patterns as well as advanced genomic sequence-based methods. While miRDeep2 does examine the predicted secondary structure of putative miRNA sequences, it does not leverage many of the sequence-based features used in state-of-the-art de novo methods. Meanwhile, other NGS-based methods, such as miRanalyzer, place an emphasis on sequence-based features without leveraging advanced expression-based features reflecting miRNA biosynthesis. This represents an opportunity to combine the strengths of NGS-based analysis with recent advances in de novo sequence-based miRNA prediction. We here develop a method, microRNA Prediction using Integrated Evidence (miPIE), which integrates both expression-based and sequence-based features to achieve significantly improved miRNA prediction performance. Feature selection identifies the 20 most discriminative features, 3 of which reflect strictly expression-based information. Evaluation using precision-recall curves, for six NGS data sets representing six diverse species, demonstrates substantial improvements in prediction performance compared to three methods: miRDeep2, miRanalyzer, and mirnovo. The individual contributions of expression-based and sequence-based features are also examined and we demonstrate that their combination is more effective than either alone. Nature Publishing Group UK 2019-02-07 /pmc/articles/PMC6367335/ /pubmed/30733467 http://dx.doi.org/10.1038/s41598-018-38107-z Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Peace, R. J. Sheikh Hassani, M. Green, J. R. miPIE: NGS-based Prediction of miRNA Using Integrated Evidence |
title | miPIE: NGS-based Prediction of miRNA Using Integrated Evidence |
title_full | miPIE: NGS-based Prediction of miRNA Using Integrated Evidence |
title_fullStr | miPIE: NGS-based Prediction of miRNA Using Integrated Evidence |
title_full_unstemmed | miPIE: NGS-based Prediction of miRNA Using Integrated Evidence |
title_short | miPIE: NGS-based Prediction of miRNA Using Integrated Evidence |
title_sort | mipie: ngs-based prediction of mirna using integrated evidence |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6367335/ https://www.ncbi.nlm.nih.gov/pubmed/30733467 http://dx.doi.org/10.1038/s41598-018-38107-z |
work_keys_str_mv | AT peacerj mipiengsbasedpredictionofmirnausingintegratedevidence AT sheikhhassanim mipiengsbasedpredictionofmirnausingintegratedevidence AT greenjr mipiengsbasedpredictionofmirnausingintegratedevidence |