Cargando…

The impact of Docker containers on the performance of genomic pipelines

Genomic pipelines consist of several pieces of third party software and, because of their experimental nature, frequent changes and updates are commonly necessary thus raising serious deployment and reproducibility issues. Docker containers are emerging as a possible solution for many of these probl...

Descripción completa

Detalles Bibliográficos
Autores principales: Di Tommaso, Paolo, Palumbo, Emilio, Chatzou, Maria, Prieto, Pablo, Heuer, Michael L., Notredame, Cedric
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4586803/
https://www.ncbi.nlm.nih.gov/pubmed/26421241
http://dx.doi.org/10.7717/peerj.1273
Descripción
Sumario:Genomic pipelines consist of several pieces of third party software and, because of their experimental nature, frequent changes and updates are commonly necessary thus raising serious deployment and reproducibility issues. Docker containers are emerging as a possible solution for many of these problems, as they allow the packaging of pipelines in an isolated and self-contained manner. This makes it easy to distribute and execute pipelines in a portable manner across a wide range of computing platforms. Thus, the question that arises is to what extent the use of Docker containers might affect the performance of these pipelines. Here we address this question and conclude that Docker containers have only a minor impact on the performance of common genomic pipelines, which is negligible when the executed jobs are long in terms of computational time.