Cargando…

systemPipeR: NGS workflow and report generation environment

BACKGROUND: Next-generation sequencing (NGS) has revolutionized how research is carried out in many areas of biology and medicine. However, the analysis of NGS data remains a major obstacle to the efficient utilization of the technology, as it requires complex multi-step processing of big data deman...

Descripción completa

Detalles Bibliográficos
Autores principales: H. Backman, Tyler W., Girke, Thomas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5029110/
https://www.ncbi.nlm.nih.gov/pubmed/27650223
http://dx.doi.org/10.1186/s12859-016-1241-0
_version_ 1782454465291354112
author H. Backman, Tyler W.
Girke, Thomas
author_facet H. Backman, Tyler W.
Girke, Thomas
author_sort H. Backman, Tyler W.
collection PubMed
description BACKGROUND: Next-generation sequencing (NGS) has revolutionized how research is carried out in many areas of biology and medicine. However, the analysis of NGS data remains a major obstacle to the efficient utilization of the technology, as it requires complex multi-step processing of big data demanding considerable computational expertise from users. While substantial effort has been invested on the development of software dedicated to the individual analysis steps of NGS experiments, insufficient resources are currently available for integrating the individual software components within the widely used R/Bioconductor environment into automated workflows capable of running the analysis of most types of NGS applications from start-to-finish in a time-efficient and reproducible manner. RESULTS: To address this need, we have developed the R/Bioconductor package systemPipeR. It is an extensible environment for both building and running end-to-end analysis workflows with automated report generation for a wide range of NGS applications. Its unique features include a uniform workflow interface across different NGS applications, automated report generation, and support for running both R and command-line software on local computers and computer clusters. A flexible sample annotation infrastructure efficiently handles complex sample sets and experimental designs. To simplify the analysis of widely used NGS applications, the package provides pre-configured workflows and reporting templates for RNA-Seq, ChIP-Seq, VAR-Seq and Ribo-Seq. Additional workflow templates will be provided in the future. CONCLUSIONS: systemPipeR accelerates the extraction of reproducible analysis results from NGS experiments. By combining the capabilities of many R/Bioconductor and command-line tools, it makes efficient use of existing software resources without limiting the user to a set of predefined methods or environments. systemPipeR is freely available for all common operating systems from Bioconductor (http://bioconductor.org/packages/devel/systemPipeR). ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1241-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5029110
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-50291102016-09-27 systemPipeR: NGS workflow and report generation environment H. Backman, Tyler W. Girke, Thomas BMC Bioinformatics Software BACKGROUND: Next-generation sequencing (NGS) has revolutionized how research is carried out in many areas of biology and medicine. However, the analysis of NGS data remains a major obstacle to the efficient utilization of the technology, as it requires complex multi-step processing of big data demanding considerable computational expertise from users. While substantial effort has been invested on the development of software dedicated to the individual analysis steps of NGS experiments, insufficient resources are currently available for integrating the individual software components within the widely used R/Bioconductor environment into automated workflows capable of running the analysis of most types of NGS applications from start-to-finish in a time-efficient and reproducible manner. RESULTS: To address this need, we have developed the R/Bioconductor package systemPipeR. It is an extensible environment for both building and running end-to-end analysis workflows with automated report generation for a wide range of NGS applications. Its unique features include a uniform workflow interface across different NGS applications, automated report generation, and support for running both R and command-line software on local computers and computer clusters. A flexible sample annotation infrastructure efficiently handles complex sample sets and experimental designs. To simplify the analysis of widely used NGS applications, the package provides pre-configured workflows and reporting templates for RNA-Seq, ChIP-Seq, VAR-Seq and Ribo-Seq. Additional workflow templates will be provided in the future. CONCLUSIONS: systemPipeR accelerates the extraction of reproducible analysis results from NGS experiments. By combining the capabilities of many R/Bioconductor and command-line tools, it makes efficient use of existing software resources without limiting the user to a set of predefined methods or environments. systemPipeR is freely available for all common operating systems from Bioconductor (http://bioconductor.org/packages/devel/systemPipeR). ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1241-0) contains supplementary material, which is available to authorized users. BioMed Central 2016-09-20 /pmc/articles/PMC5029110/ /pubmed/27650223 http://dx.doi.org/10.1186/s12859-016-1241-0 Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
H. Backman, Tyler W.
Girke, Thomas
systemPipeR: NGS workflow and report generation environment
title systemPipeR: NGS workflow and report generation environment
title_full systemPipeR: NGS workflow and report generation environment
title_fullStr systemPipeR: NGS workflow and report generation environment
title_full_unstemmed systemPipeR: NGS workflow and report generation environment
title_short systemPipeR: NGS workflow and report generation environment
title_sort systempiper: ngs workflow and report generation environment
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5029110/
https://www.ncbi.nlm.nih.gov/pubmed/27650223
http://dx.doi.org/10.1186/s12859-016-1241-0
work_keys_str_mv AT hbackmantylerw systempiperngsworkflowandreportgenerationenvironment
AT girkethomas systempiperngsworkflowandreportgenerationenvironment