Cargando…
A Unified Computational Framework for a Robust, Reliable, and Reproducible Identification of Novel miRNAs From the RNA Sequencing Data
In eukaryotic cells, miRNAs regulate a plethora of cellular functionalities ranging from cellular metabolisms, and development to the regulation of biological networks and pathways, both under homeostatic and pathological states like cancer.Despite their immense importance as key regulators of cellu...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9580950/ https://www.ncbi.nlm.nih.gov/pubmed/36304305 http://dx.doi.org/10.3389/fbinf.2022.842051 |
_version_ | 1784812507549925376 |
---|---|
author | Ruhela, Vivek Gupta, Anubha Sriram, K. Ahuja, Gaurav Kaur, Gurvinder Gupta, Ritu |
author_facet | Ruhela, Vivek Gupta, Anubha Sriram, K. Ahuja, Gaurav Kaur, Gurvinder Gupta, Ritu |
author_sort | Ruhela, Vivek |
collection | PubMed |
description | In eukaryotic cells, miRNAs regulate a plethora of cellular functionalities ranging from cellular metabolisms, and development to the regulation of biological networks and pathways, both under homeostatic and pathological states like cancer.Despite their immense importance as key regulators of cellular processes, accurate and reliable estimation of miRNAs using Next Generation Sequencing is challenging, largely due to the limited availability of robust computational tools/methods/pipelines. Here, we introduce miRPipe, an end-to-end computational framework for the identification, characterization, and expression estimation of small RNAs, including the known and novel miRNAs and previously annotated pi-RNAs from small-RNA sequencing profiles. Our workflow detects unique novel miRNAs by incorporating the sequence information of seed and non-seed regions, concomitant with clustering analysis. This approach allows reliable and reproducible detection of unique novel miRNAs and functionally same miRNAs (paralogues). We validated the performance of miRPipe with the available state-of-the-art pipelines using both synthetic datasets generated using the newly developed miRSim tool and three cancer datasets (Chronic Lymphocytic Leukemia, Lung cancer, and breast cancer). In the experiment over the synthetic dataset, miRPipe is observed to outperform the existing state-of-the-art pipelines (accuracy: 95.23% and F (1)-score: 94.17%). Analysis on all the three cancer datasets shows that miRPipe is able to extract more number of known dysregulated miRNAs or piRNAs from the datasets as compared to the existing pipelines. |
format | Online Article Text |
id | pubmed-9580950 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-95809502022-10-26 A Unified Computational Framework for a Robust, Reliable, and Reproducible Identification of Novel miRNAs From the RNA Sequencing Data Ruhela, Vivek Gupta, Anubha Sriram, K. Ahuja, Gaurav Kaur, Gurvinder Gupta, Ritu Front Bioinform Bioinformatics In eukaryotic cells, miRNAs regulate a plethora of cellular functionalities ranging from cellular metabolisms, and development to the regulation of biological networks and pathways, both under homeostatic and pathological states like cancer.Despite their immense importance as key regulators of cellular processes, accurate and reliable estimation of miRNAs using Next Generation Sequencing is challenging, largely due to the limited availability of robust computational tools/methods/pipelines. Here, we introduce miRPipe, an end-to-end computational framework for the identification, characterization, and expression estimation of small RNAs, including the known and novel miRNAs and previously annotated pi-RNAs from small-RNA sequencing profiles. Our workflow detects unique novel miRNAs by incorporating the sequence information of seed and non-seed regions, concomitant with clustering analysis. This approach allows reliable and reproducible detection of unique novel miRNAs and functionally same miRNAs (paralogues). We validated the performance of miRPipe with the available state-of-the-art pipelines using both synthetic datasets generated using the newly developed miRSim tool and three cancer datasets (Chronic Lymphocytic Leukemia, Lung cancer, and breast cancer). In the experiment over the synthetic dataset, miRPipe is observed to outperform the existing state-of-the-art pipelines (accuracy: 95.23% and F (1)-score: 94.17%). Analysis on all the three cancer datasets shows that miRPipe is able to extract more number of known dysregulated miRNAs or piRNAs from the datasets as compared to the existing pipelines. Frontiers Media S.A. 2022-07-08 /pmc/articles/PMC9580950/ /pubmed/36304305 http://dx.doi.org/10.3389/fbinf.2022.842051 Text en Copyright © 2022 Ruhela, Gupta, Sriram, Ahuja, Kaur and Gupta. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Bioinformatics Ruhela, Vivek Gupta, Anubha Sriram, K. Ahuja, Gaurav Kaur, Gurvinder Gupta, Ritu A Unified Computational Framework for a Robust, Reliable, and Reproducible Identification of Novel miRNAs From the RNA Sequencing Data |
title | A Unified Computational Framework for a Robust, Reliable, and Reproducible Identification of Novel miRNAs From the RNA Sequencing Data |
title_full | A Unified Computational Framework for a Robust, Reliable, and Reproducible Identification of Novel miRNAs From the RNA Sequencing Data |
title_fullStr | A Unified Computational Framework for a Robust, Reliable, and Reproducible Identification of Novel miRNAs From the RNA Sequencing Data |
title_full_unstemmed | A Unified Computational Framework for a Robust, Reliable, and Reproducible Identification of Novel miRNAs From the RNA Sequencing Data |
title_short | A Unified Computational Framework for a Robust, Reliable, and Reproducible Identification of Novel miRNAs From the RNA Sequencing Data |
title_sort | unified computational framework for a robust, reliable, and reproducible identification of novel mirnas from the rna sequencing data |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9580950/ https://www.ncbi.nlm.nih.gov/pubmed/36304305 http://dx.doi.org/10.3389/fbinf.2022.842051 |
work_keys_str_mv | AT ruhelavivek aunifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata AT guptaanubha aunifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata AT sriramk aunifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata AT ahujagaurav aunifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata AT kaurgurvinder aunifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata AT guptaritu aunifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata AT ruhelavivek unifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata AT guptaanubha unifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata AT sriramk unifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata AT ahujagaurav unifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata AT kaurgurvinder unifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata AT guptaritu unifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata |