Cargando…

A Unified Computational Framework for a Robust, Reliable, and Reproducible Identification of Novel miRNAs From the RNA Sequencing Data

In eukaryotic cells, miRNAs regulate a plethora of cellular functionalities ranging from cellular metabolisms, and development to the regulation of biological networks and pathways, both under homeostatic and pathological states like cancer.Despite their immense importance as key regulators of cellu...

Descripción completa

Detalles Bibliográficos
Autores principales: Ruhela, Vivek, Gupta, Anubha, Sriram, K., Ahuja, Gaurav, Kaur, Gurvinder, Gupta, Ritu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9580950/
https://www.ncbi.nlm.nih.gov/pubmed/36304305
http://dx.doi.org/10.3389/fbinf.2022.842051
_version_ 1784812507549925376
author Ruhela, Vivek
Gupta, Anubha
Sriram, K.
Ahuja, Gaurav
Kaur, Gurvinder
Gupta, Ritu
author_facet Ruhela, Vivek
Gupta, Anubha
Sriram, K.
Ahuja, Gaurav
Kaur, Gurvinder
Gupta, Ritu
author_sort Ruhela, Vivek
collection PubMed
description In eukaryotic cells, miRNAs regulate a plethora of cellular functionalities ranging from cellular metabolisms, and development to the regulation of biological networks and pathways, both under homeostatic and pathological states like cancer.Despite their immense importance as key regulators of cellular processes, accurate and reliable estimation of miRNAs using Next Generation Sequencing is challenging, largely due to the limited availability of robust computational tools/methods/pipelines. Here, we introduce miRPipe, an end-to-end computational framework for the identification, characterization, and expression estimation of small RNAs, including the known and novel miRNAs and previously annotated pi-RNAs from small-RNA sequencing profiles. Our workflow detects unique novel miRNAs by incorporating the sequence information of seed and non-seed regions, concomitant with clustering analysis. This approach allows reliable and reproducible detection of unique novel miRNAs and functionally same miRNAs (paralogues). We validated the performance of miRPipe with the available state-of-the-art pipelines using both synthetic datasets generated using the newly developed miRSim tool and three cancer datasets (Chronic Lymphocytic Leukemia, Lung cancer, and breast cancer). In the experiment over the synthetic dataset, miRPipe is observed to outperform the existing state-of-the-art pipelines (accuracy: 95.23% and F (1)-score: 94.17%). Analysis on all the three cancer datasets shows that miRPipe is able to extract more number of known dysregulated miRNAs or piRNAs from the datasets as compared to the existing pipelines.
format Online
Article
Text
id pubmed-9580950
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-95809502022-10-26 A Unified Computational Framework for a Robust, Reliable, and Reproducible Identification of Novel miRNAs From the RNA Sequencing Data Ruhela, Vivek Gupta, Anubha Sriram, K. Ahuja, Gaurav Kaur, Gurvinder Gupta, Ritu Front Bioinform Bioinformatics In eukaryotic cells, miRNAs regulate a plethora of cellular functionalities ranging from cellular metabolisms, and development to the regulation of biological networks and pathways, both under homeostatic and pathological states like cancer.Despite their immense importance as key regulators of cellular processes, accurate and reliable estimation of miRNAs using Next Generation Sequencing is challenging, largely due to the limited availability of robust computational tools/methods/pipelines. Here, we introduce miRPipe, an end-to-end computational framework for the identification, characterization, and expression estimation of small RNAs, including the known and novel miRNAs and previously annotated pi-RNAs from small-RNA sequencing profiles. Our workflow detects unique novel miRNAs by incorporating the sequence information of seed and non-seed regions, concomitant with clustering analysis. This approach allows reliable and reproducible detection of unique novel miRNAs and functionally same miRNAs (paralogues). We validated the performance of miRPipe with the available state-of-the-art pipelines using both synthetic datasets generated using the newly developed miRSim tool and three cancer datasets (Chronic Lymphocytic Leukemia, Lung cancer, and breast cancer). In the experiment over the synthetic dataset, miRPipe is observed to outperform the existing state-of-the-art pipelines (accuracy: 95.23% and F (1)-score: 94.17%). Analysis on all the three cancer datasets shows that miRPipe is able to extract more number of known dysregulated miRNAs or piRNAs from the datasets as compared to the existing pipelines. Frontiers Media S.A. 2022-07-08 /pmc/articles/PMC9580950/ /pubmed/36304305 http://dx.doi.org/10.3389/fbinf.2022.842051 Text en Copyright © 2022 Ruhela, Gupta, Sriram, Ahuja, Kaur and Gupta. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioinformatics
Ruhela, Vivek
Gupta, Anubha
Sriram, K.
Ahuja, Gaurav
Kaur, Gurvinder
Gupta, Ritu
A Unified Computational Framework for a Robust, Reliable, and Reproducible Identification of Novel miRNAs From the RNA Sequencing Data
title A Unified Computational Framework for a Robust, Reliable, and Reproducible Identification of Novel miRNAs From the RNA Sequencing Data
title_full A Unified Computational Framework for a Robust, Reliable, and Reproducible Identification of Novel miRNAs From the RNA Sequencing Data
title_fullStr A Unified Computational Framework for a Robust, Reliable, and Reproducible Identification of Novel miRNAs From the RNA Sequencing Data
title_full_unstemmed A Unified Computational Framework for a Robust, Reliable, and Reproducible Identification of Novel miRNAs From the RNA Sequencing Data
title_short A Unified Computational Framework for a Robust, Reliable, and Reproducible Identification of Novel miRNAs From the RNA Sequencing Data
title_sort unified computational framework for a robust, reliable, and reproducible identification of novel mirnas from the rna sequencing data
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9580950/
https://www.ncbi.nlm.nih.gov/pubmed/36304305
http://dx.doi.org/10.3389/fbinf.2022.842051
work_keys_str_mv AT ruhelavivek aunifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata
AT guptaanubha aunifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata
AT sriramk aunifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata
AT ahujagaurav aunifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata
AT kaurgurvinder aunifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata
AT guptaritu aunifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata
AT ruhelavivek unifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata
AT guptaanubha unifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata
AT sriramk unifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata
AT ahujagaurav unifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata
AT kaurgurvinder unifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata
AT guptaritu unifiedcomputationalframeworkforarobustreliableandreproducibleidentificationofnovelmirnasfromthernasequencingdata