Cargando…

CAP-miRSeq: a comprehensive analysis pipeline for microRNA sequencing data

BACKGROUND: miRNAs play a key role in normal physiology and various diseases. miRNA profiling through next generation sequencing (miRNA-seq) has become the main platform for biological research and biomarker discovery. However, analyzing miRNA sequencing data is challenging as it needs significant a...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Zhifu, Evans, Jared, Bhagwate, Aditya, Middha, Sumit, Bockol, Matthew, Yan, Huihuang, Kocher, Jean-Pierre
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4070549/
https://www.ncbi.nlm.nih.gov/pubmed/24894665
http://dx.doi.org/10.1186/1471-2164-15-423
_version_ 1782322706233950208
author Sun, Zhifu
Evans, Jared
Bhagwate, Aditya
Middha, Sumit
Bockol, Matthew
Yan, Huihuang
Kocher, Jean-Pierre
author_facet Sun, Zhifu
Evans, Jared
Bhagwate, Aditya
Middha, Sumit
Bockol, Matthew
Yan, Huihuang
Kocher, Jean-Pierre
author_sort Sun, Zhifu
collection PubMed
description BACKGROUND: miRNAs play a key role in normal physiology and various diseases. miRNA profiling through next generation sequencing (miRNA-seq) has become the main platform for biological research and biomarker discovery. However, analyzing miRNA sequencing data is challenging as it needs significant amount of computational resources and bioinformatics expertise. Several web based analytical tools have been developed but they are limited to processing one or a pair of samples at time and are not suitable for a large scale study. Lack of flexibility and reliability of these web applications are also common issues. RESULTS: We developed a Comprehensive Analysis Pipeline for microRNA Sequencing data (CAP-miRSeq) that integrates read pre-processing, alignment, mature/precursor/novel miRNA detection and quantification, data visualization, variant detection in miRNA coding region, and more flexible differential expression analysis between experimental conditions. According to computational infrastructure, users can install the package locally or deploy it in Amazon Cloud to run samples sequentially or in parallel for a large number of samples for speedy analyses. In either case, summary and expression reports for all samples are generated for easier quality assessment and downstream analyses. Using well characterized data, we demonstrated the pipeline’s superior performances, flexibility, and practical use in research and biomarker discovery. CONCLUSIONS: CAP-miRSeq is a powerful and flexible tool for users to process and analyze miRNA-seq data scalable from a few to hundreds of samples. The results are presented in the convenient way for investigators or analysts to conduct further investigation and discovery. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-423) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4070549
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40705492014-06-27 CAP-miRSeq: a comprehensive analysis pipeline for microRNA sequencing data Sun, Zhifu Evans, Jared Bhagwate, Aditya Middha, Sumit Bockol, Matthew Yan, Huihuang Kocher, Jean-Pierre BMC Genomics Software BACKGROUND: miRNAs play a key role in normal physiology and various diseases. miRNA profiling through next generation sequencing (miRNA-seq) has become the main platform for biological research and biomarker discovery. However, analyzing miRNA sequencing data is challenging as it needs significant amount of computational resources and bioinformatics expertise. Several web based analytical tools have been developed but they are limited to processing one or a pair of samples at time and are not suitable for a large scale study. Lack of flexibility and reliability of these web applications are also common issues. RESULTS: We developed a Comprehensive Analysis Pipeline for microRNA Sequencing data (CAP-miRSeq) that integrates read pre-processing, alignment, mature/precursor/novel miRNA detection and quantification, data visualization, variant detection in miRNA coding region, and more flexible differential expression analysis between experimental conditions. According to computational infrastructure, users can install the package locally or deploy it in Amazon Cloud to run samples sequentially or in parallel for a large number of samples for speedy analyses. In either case, summary and expression reports for all samples are generated for easier quality assessment and downstream analyses. Using well characterized data, we demonstrated the pipeline’s superior performances, flexibility, and practical use in research and biomarker discovery. CONCLUSIONS: CAP-miRSeq is a powerful and flexible tool for users to process and analyze miRNA-seq data scalable from a few to hundreds of samples. The results are presented in the convenient way for investigators or analysts to conduct further investigation and discovery. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-423) contains supplementary material, which is available to authorized users. BioMed Central 2014-06-03 /pmc/articles/PMC4070549/ /pubmed/24894665 http://dx.doi.org/10.1186/1471-2164-15-423 Text en © Sun et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Sun, Zhifu
Evans, Jared
Bhagwate, Aditya
Middha, Sumit
Bockol, Matthew
Yan, Huihuang
Kocher, Jean-Pierre
CAP-miRSeq: a comprehensive analysis pipeline for microRNA sequencing data
title CAP-miRSeq: a comprehensive analysis pipeline for microRNA sequencing data
title_full CAP-miRSeq: a comprehensive analysis pipeline for microRNA sequencing data
title_fullStr CAP-miRSeq: a comprehensive analysis pipeline for microRNA sequencing data
title_full_unstemmed CAP-miRSeq: a comprehensive analysis pipeline for microRNA sequencing data
title_short CAP-miRSeq: a comprehensive analysis pipeline for microRNA sequencing data
title_sort cap-mirseq: a comprehensive analysis pipeline for microrna sequencing data
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4070549/
https://www.ncbi.nlm.nih.gov/pubmed/24894665
http://dx.doi.org/10.1186/1471-2164-15-423
work_keys_str_mv AT sunzhifu capmirseqacomprehensiveanalysispipelineformicrornasequencingdata
AT evansjared capmirseqacomprehensiveanalysispipelineformicrornasequencingdata
AT bhagwateaditya capmirseqacomprehensiveanalysispipelineformicrornasequencingdata
AT middhasumit capmirseqacomprehensiveanalysispipelineformicrornasequencingdata
AT bockolmatthew capmirseqacomprehensiveanalysispipelineformicrornasequencingdata
AT yanhuihuang capmirseqacomprehensiveanalysispipelineformicrornasequencingdata
AT kocherjeanpierre capmirseqacomprehensiveanalysispipelineformicrornasequencingdata