Cargando…

TOGGLE: toolbox for generic NGS analyses

BACKGROUND: The explosion of NGS (Next Generation Sequencing) sequence data requires a huge effort in Bioinformatics methods and analyses. The creation of dedicated, robust and reliable pipelines able to handle dozens of samples from raw FASTQ data to relevant biological data is a time-consuming tas...

Descripción completa

Detalles Bibliográficos
Autores principales: Monat, Cécile, Tranchant-Dubreuil, Christine, Kougbeadjo, Ayité, Farcy, Cédric, Ortega-Abboud, Enrique, Amanzougarene, Souhila, Ravel, Sébastien, Agbessi, Mawussé, Orjuela-Bouniol, Julie, Summo, Maryline, Sabot, François
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4640241/
https://www.ncbi.nlm.nih.gov/pubmed/26552596
http://dx.doi.org/10.1186/s12859-015-0795-6
Descripción
Sumario:BACKGROUND: The explosion of NGS (Next Generation Sequencing) sequence data requires a huge effort in Bioinformatics methods and analyses. The creation of dedicated, robust and reliable pipelines able to handle dozens of samples from raw FASTQ data to relevant biological data is a time-consuming task in all projects relying on NGS. To address this, we created a generic and modular toolbox for developing such pipelines. RESULTS: TOGGLE (TOolbox for Generic nGs anaLysEs) is a suite of tools able to design pipelines that manage large sets of NGS softwares and utilities. Moreover, TOGGLE offers an easy way to manipulate the various options of the different softwares through the pipelines in using a single basic configuration file, which can be changed for each assay without having to change the code itself. We also describe one implementation of TOGGLE in a complete analysis pipeline designed for SNP discovery for large sets of genomic data, ready to use in different environments (from a single machine to HPC clusters). CONCLUSION: TOGGLE speeds up the creation of robust pipelines with reliable log tracking and data flow, for a large range of analyses. Moreover, it enables Biologists to concentrate on the biological relevance of results, and change the experimental conditions easily. The whole code and test data are available at https://github.com/SouthGreenPlatform/TOGGLE. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0795-6) contains supplementary material, which is available to authorized users.