Cargando…

SnakeCube: containerized and automated pipeline for de novo genome assembly in HPC environments

OBJECTIVE: The rapid progress in sequencing technology and related bioinformatics tools aims at disentangling diversity and conservation issues through genome analyses. The foremost challenges of the field involve coping with questions emerging from the swift development and application of new algor...

Descripción completa

Detalles Bibliográficos
Autores principales: Angelova, Nelina, Danis, Theodoros, Lagnel, Jacques, Tsigenopoulos, Costas S., Manousaki, Tereza
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8900408/
https://www.ncbi.nlm.nih.gov/pubmed/35255960
http://dx.doi.org/10.1186/s13104-022-05978-5
Descripción
Sumario:OBJECTIVE: The rapid progress in sequencing technology and related bioinformatics tools aims at disentangling diversity and conservation issues through genome analyses. The foremost challenges of the field involve coping with questions emerging from the swift development and application of new algorithms, as well as the establishment of standardized analysis approaches that promote transparency and transferability in research. RESULTS: Here, we present SnakeCube, an automated and containerized whole de novo genome assembly pipeline that runs within isolated, secured environments and scales for use in High Performance Computing (HPC) domains. SnakeCube was optimized for its performance and tested for its effectiveness with various inputs, highlighting its safe and robust universal use in the field. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13104-022-05978-5.