Cargando…

The impact of Docker containers on the performance of genomic pipelines

Genomic pipelines consist of several pieces of third party software and, because of their experimental nature, frequent changes and updates are commonly necessary thus raising serious deployment and reproducibility issues. Docker containers are emerging as a possible solution for many of these probl...

Descripción completa

Detalles Bibliográficos
Autores principales: Di Tommaso, Paolo, Palumbo, Emilio, Chatzou, Maria, Prieto, Pablo, Heuer, Michael L., Notredame, Cedric
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4586803/
https://www.ncbi.nlm.nih.gov/pubmed/26421241
http://dx.doi.org/10.7717/peerj.1273
_version_ 1782392437222670336
author Di Tommaso, Paolo
Palumbo, Emilio
Chatzou, Maria
Prieto, Pablo
Heuer, Michael L.
Notredame, Cedric
author_facet Di Tommaso, Paolo
Palumbo, Emilio
Chatzou, Maria
Prieto, Pablo
Heuer, Michael L.
Notredame, Cedric
author_sort Di Tommaso, Paolo
collection PubMed
description Genomic pipelines consist of several pieces of third party software and, because of their experimental nature, frequent changes and updates are commonly necessary thus raising serious deployment and reproducibility issues. Docker containers are emerging as a possible solution for many of these problems, as they allow the packaging of pipelines in an isolated and self-contained manner. This makes it easy to distribute and execute pipelines in a portable manner across a wide range of computing platforms. Thus, the question that arises is to what extent the use of Docker containers might affect the performance of these pipelines. Here we address this question and conclude that Docker containers have only a minor impact on the performance of common genomic pipelines, which is negligible when the executed jobs are long in terms of computational time.
format Online
Article
Text
id pubmed-4586803
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-45868032015-09-29 The impact of Docker containers on the performance of genomic pipelines Di Tommaso, Paolo Palumbo, Emilio Chatzou, Maria Prieto, Pablo Heuer, Michael L. Notredame, Cedric PeerJ Bioinformatics Genomic pipelines consist of several pieces of third party software and, because of their experimental nature, frequent changes and updates are commonly necessary thus raising serious deployment and reproducibility issues. Docker containers are emerging as a possible solution for many of these problems, as they allow the packaging of pipelines in an isolated and self-contained manner. This makes it easy to distribute and execute pipelines in a portable manner across a wide range of computing platforms. Thus, the question that arises is to what extent the use of Docker containers might affect the performance of these pipelines. Here we address this question and conclude that Docker containers have only a minor impact on the performance of common genomic pipelines, which is negligible when the executed jobs are long in terms of computational time. PeerJ Inc. 2015-09-24 /pmc/articles/PMC4586803/ /pubmed/26421241 http://dx.doi.org/10.7717/peerj.1273 Text en © 2015 Di Tommaso et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Di Tommaso, Paolo
Palumbo, Emilio
Chatzou, Maria
Prieto, Pablo
Heuer, Michael L.
Notredame, Cedric
The impact of Docker containers on the performance of genomic pipelines
title The impact of Docker containers on the performance of genomic pipelines
title_full The impact of Docker containers on the performance of genomic pipelines
title_fullStr The impact of Docker containers on the performance of genomic pipelines
title_full_unstemmed The impact of Docker containers on the performance of genomic pipelines
title_short The impact of Docker containers on the performance of genomic pipelines
title_sort impact of docker containers on the performance of genomic pipelines
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4586803/
https://www.ncbi.nlm.nih.gov/pubmed/26421241
http://dx.doi.org/10.7717/peerj.1273
work_keys_str_mv AT ditommasopaolo theimpactofdockercontainersontheperformanceofgenomicpipelines
AT palumboemilio theimpactofdockercontainersontheperformanceofgenomicpipelines
AT chatzoumaria theimpactofdockercontainersontheperformanceofgenomicpipelines
AT prietopablo theimpactofdockercontainersontheperformanceofgenomicpipelines
AT heuermichaell theimpactofdockercontainersontheperformanceofgenomicpipelines
AT notredamecedric theimpactofdockercontainersontheperformanceofgenomicpipelines
AT ditommasopaolo impactofdockercontainersontheperformanceofgenomicpipelines
AT palumboemilio impactofdockercontainersontheperformanceofgenomicpipelines
AT chatzoumaria impactofdockercontainersontheperformanceofgenomicpipelines
AT prietopablo impactofdockercontainersontheperformanceofgenomicpipelines
AT heuermichaell impactofdockercontainersontheperformanceofgenomicpipelines
AT notredamecedric impactofdockercontainersontheperformanceofgenomicpipelines