Cargando…

aRNApipe: a balanced, efficient and distributed pipeline for processing RNA-seq data in high-performance computing environments

SUMMARY: The wide range of RNA-seq applications and their high-computational needs require the development of pipelines orchestrating the entire workflow and optimizing usage of available computational resources. We present aRNApipe, a project-oriented pipeline for processing of RNA-seq data in high...

Descripción completa

Detalles Bibliográficos
Autores principales: Alonso, Arnald, Lasseigne, Brittany N, Williams, Kelly, Nielsen, Josh, Ramaker, Ryne C, Hardigan, Andrew A, Johnston, Bobbi, Roberts, Brian S, Cooper, Sara J, Marsal, Sara, Myers, Richard M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5447234/
https://www.ncbi.nlm.nih.gov/pubmed/28108448
http://dx.doi.org/10.1093/bioinformatics/btx023
Descripción
Sumario:SUMMARY: The wide range of RNA-seq applications and their high-computational needs require the development of pipelines orchestrating the entire workflow and optimizing usage of available computational resources. We present aRNApipe, a project-oriented pipeline for processing of RNA-seq data in high-performance cluster environments. aRNApipe is highly modular and can be easily migrated to any high-performance computing (HPC) environment. The current applications included in aRNApipe combine the essential RNA-seq primary analyses, including quality control metrics, transcript alignment, count generation, transcript fusion identification, alternative splicing and sequence variant calling. aRNApipe is project-oriented and dynamic so users can easily update analyses to include or exclude samples or enable additional processing modules. Workflow parameters are easily set using a single configuration file that provides centralized tracking of all analytical processes. Finally, aRNApipe incorporates interactive web reports for sample tracking and a tool for managing the genome assemblies available to perform an analysis. AVAILABILITY AND DOCUMENTATION: https://github.com/HudsonAlpha/aRNAPipe; DOI: 10.5281/zenodo.202950 SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.