Cargando…

miR-BAG: Bagging Based Identification of MicroRNA Precursors

Non-coding elements such as miRNAs play key regulatory roles in living systems. These ultra-short, ∼21 bp long, RNA molecules are derived from their hairpin precursors and usually participate in negative gene regulation by binding the target mRNAs. Discovering miRNA candidate regions across the geno...

Descripción completa

Detalles Bibliográficos
Autores principales: Jha, Ashwani, Chauhan, Rohit, Mehra, Mrigaya, Singh, Heikham Russiachand, Shankar, Ravi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3458082/
https://www.ncbi.nlm.nih.gov/pubmed/23049860
http://dx.doi.org/10.1371/journal.pone.0045782
_version_ 1782244623728508928
author Jha, Ashwani
Chauhan, Rohit
Mehra, Mrigaya
Singh, Heikham Russiachand
Shankar, Ravi
author_facet Jha, Ashwani
Chauhan, Rohit
Mehra, Mrigaya
Singh, Heikham Russiachand
Shankar, Ravi
author_sort Jha, Ashwani
collection PubMed
description Non-coding elements such as miRNAs play key regulatory roles in living systems. These ultra-short, ∼21 bp long, RNA molecules are derived from their hairpin precursors and usually participate in negative gene regulation by binding the target mRNAs. Discovering miRNA candidate regions across the genome has been a challenging problem. Most of the existing tools work reliably only for limited datasets. Here, we have presented a novel reliable approach, miR-BAG, developed to identify miRNA candidate regions in genomes by scanning sequences as well as by using next generation sequencing (NGS) data. miR-BAG utilizes a bootstrap aggregation based machine learning approach, successfully creating an ensemble of complementary learners to attain high accuracy while balancing sensitivity and specificity. miR-BAG was developed for wide range of species and tested extensively for performance over a wide range of experimentally validated data. Consideration of position-specific variation of triplet structural profiles and mature miRNA anchored structural profiles had a positive impact on performance. miR-BAG’s performance was found consistent and the accuracy level was observed to be >90% for most of the species considered in the present study. In a detailed comparative analysis, miR-BAG performed better than six existing tools. Using miR-BAG NGS module, we identified a total of 22 novel miRNA candidate regions in cow genome in addition to a total of 42 cow specific miRNA regions. In practice, discovery of miRNA regions in a genome demands high-throughput data analysis, requiring large amount of processing. Considering this, miR-BAG has been developed in multi-threaded parallel architecture as a web server as well as a user friendly GUI standalone version.
format Online
Article
Text
id pubmed-3458082
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-34580822012-10-03 miR-BAG: Bagging Based Identification of MicroRNA Precursors Jha, Ashwani Chauhan, Rohit Mehra, Mrigaya Singh, Heikham Russiachand Shankar, Ravi PLoS One Research Article Non-coding elements such as miRNAs play key regulatory roles in living systems. These ultra-short, ∼21 bp long, RNA molecules are derived from their hairpin precursors and usually participate in negative gene regulation by binding the target mRNAs. Discovering miRNA candidate regions across the genome has been a challenging problem. Most of the existing tools work reliably only for limited datasets. Here, we have presented a novel reliable approach, miR-BAG, developed to identify miRNA candidate regions in genomes by scanning sequences as well as by using next generation sequencing (NGS) data. miR-BAG utilizes a bootstrap aggregation based machine learning approach, successfully creating an ensemble of complementary learners to attain high accuracy while balancing sensitivity and specificity. miR-BAG was developed for wide range of species and tested extensively for performance over a wide range of experimentally validated data. Consideration of position-specific variation of triplet structural profiles and mature miRNA anchored structural profiles had a positive impact on performance. miR-BAG’s performance was found consistent and the accuracy level was observed to be >90% for most of the species considered in the present study. In a detailed comparative analysis, miR-BAG performed better than six existing tools. Using miR-BAG NGS module, we identified a total of 22 novel miRNA candidate regions in cow genome in addition to a total of 42 cow specific miRNA regions. In practice, discovery of miRNA regions in a genome demands high-throughput data analysis, requiring large amount of processing. Considering this, miR-BAG has been developed in multi-threaded parallel architecture as a web server as well as a user friendly GUI standalone version. Public Library of Science 2012-09-25 /pmc/articles/PMC3458082/ /pubmed/23049860 http://dx.doi.org/10.1371/journal.pone.0045782 Text en © 2012 Jha et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Jha, Ashwani
Chauhan, Rohit
Mehra, Mrigaya
Singh, Heikham Russiachand
Shankar, Ravi
miR-BAG: Bagging Based Identification of MicroRNA Precursors
title miR-BAG: Bagging Based Identification of MicroRNA Precursors
title_full miR-BAG: Bagging Based Identification of MicroRNA Precursors
title_fullStr miR-BAG: Bagging Based Identification of MicroRNA Precursors
title_full_unstemmed miR-BAG: Bagging Based Identification of MicroRNA Precursors
title_short miR-BAG: Bagging Based Identification of MicroRNA Precursors
title_sort mir-bag: bagging based identification of microrna precursors
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3458082/
https://www.ncbi.nlm.nih.gov/pubmed/23049860
http://dx.doi.org/10.1371/journal.pone.0045782
work_keys_str_mv AT jhaashwani mirbagbaggingbasedidentificationofmicrornaprecursors
AT chauhanrohit mirbagbaggingbasedidentificationofmicrornaprecursors
AT mehramrigaya mirbagbaggingbasedidentificationofmicrornaprecursors
AT singhheikhamrussiachand mirbagbaggingbasedidentificationofmicrornaprecursors
AT shankarravi mirbagbaggingbasedidentificationofmicrornaprecursors