Cargando…
NGSANE: a lightweight production informatics framework for high-throughput data analysis
Summary: The initial steps in the analysis of next-generation sequencing data can be automated by way of software ‘pipelines’. However, individual components depreciate rapidly because of the evolving technology and analysis methods, often rendering entire versions of production informatics pipeline...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4016703/ https://www.ncbi.nlm.nih.gov/pubmed/24470576 http://dx.doi.org/10.1093/bioinformatics/btu036 |
_version_ | 1782315552908247040 |
---|---|
author | Buske, Fabian A. French, Hugh J. Smith, Martin A. Clark, Susan J. Bauer, Denis C. |
author_facet | Buske, Fabian A. French, Hugh J. Smith, Martin A. Clark, Susan J. Bauer, Denis C. |
author_sort | Buske, Fabian A. |
collection | PubMed |
description | Summary: The initial steps in the analysis of next-generation sequencing data can be automated by way of software ‘pipelines’. However, individual components depreciate rapidly because of the evolving technology and analysis methods, often rendering entire versions of production informatics pipelines obsolete. Constructing pipelines from Linux bash commands enables the use of hot swappable modular components as opposed to the more rigid program call wrapping by higher level languages, as implemented in comparable published pipelining systems. Here we present Next Generation Sequencing ANalysis for Enterprises (NGSANE), a Linux-based, high-performance-computing-enabled framework that minimizes overhead for set up and processing of new projects, yet maintains full flexibility of custom scripting when processing raw sequence data. Availability and implementation: Ngsane is implemented in bash and publicly available under BSD (3-Clause) licence via GitHub at https://github.com/BauerLab/ngsane. Contact: Denis.Bauer@csiro.au Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-4016703 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-40167032014-05-12 NGSANE: a lightweight production informatics framework for high-throughput data analysis Buske, Fabian A. French, Hugh J. Smith, Martin A. Clark, Susan J. Bauer, Denis C. Bioinformatics Applications Notes Summary: The initial steps in the analysis of next-generation sequencing data can be automated by way of software ‘pipelines’. However, individual components depreciate rapidly because of the evolving technology and analysis methods, often rendering entire versions of production informatics pipelines obsolete. Constructing pipelines from Linux bash commands enables the use of hot swappable modular components as opposed to the more rigid program call wrapping by higher level languages, as implemented in comparable published pipelining systems. Here we present Next Generation Sequencing ANalysis for Enterprises (NGSANE), a Linux-based, high-performance-computing-enabled framework that minimizes overhead for set up and processing of new projects, yet maintains full flexibility of custom scripting when processing raw sequence data. Availability and implementation: Ngsane is implemented in bash and publicly available under BSD (3-Clause) licence via GitHub at https://github.com/BauerLab/ngsane. Contact: Denis.Bauer@csiro.au Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2014-05-15 2014-01-26 /pmc/articles/PMC4016703/ /pubmed/24470576 http://dx.doi.org/10.1093/bioinformatics/btu036 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Applications Notes Buske, Fabian A. French, Hugh J. Smith, Martin A. Clark, Susan J. Bauer, Denis C. NGSANE: a lightweight production informatics framework for high-throughput data analysis |
title | NGSANE: a lightweight production informatics framework for high-throughput data analysis |
title_full | NGSANE: a lightweight production informatics framework for high-throughput data analysis |
title_fullStr | NGSANE: a lightweight production informatics framework for high-throughput data analysis |
title_full_unstemmed | NGSANE: a lightweight production informatics framework for high-throughput data analysis |
title_short | NGSANE: a lightweight production informatics framework for high-throughput data analysis |
title_sort | ngsane: a lightweight production informatics framework for high-throughput data analysis |
topic | Applications Notes |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4016703/ https://www.ncbi.nlm.nih.gov/pubmed/24470576 http://dx.doi.org/10.1093/bioinformatics/btu036 |
work_keys_str_mv | AT buskefabiana ngsanealightweightproductioninformaticsframeworkforhighthroughputdataanalysis AT frenchhughj ngsanealightweightproductioninformaticsframeworkforhighthroughputdataanalysis AT smithmartina ngsanealightweightproductioninformaticsframeworkforhighthroughputdataanalysis AT clarksusanj ngsanealightweightproductioninformaticsframeworkforhighthroughputdataanalysis AT bauerdenisc ngsanealightweightproductioninformaticsframeworkforhighthroughputdataanalysis |