Cargando…

Solving the Container Explosion Problem for Distributed High Throughput Computing

Container technologies are seeing wider use at advanced computing facilities for managing highly complex applications that must execute at multiple sites. However, in a distributed high throughput computing setting, the unrestricted use of containers can result in the container explosion problem. If...

Descripción completa

Detalles Bibliográficos
Autores principales: Shaffer, Tim, Hazekamp, Nicholas, Blomer, Jakob, Thain, Douglas
Lenguaje:eng
Publicado: 2020
Materias:
Acceso en línea:https://dx.doi.org/10.1109/ipdps47924.2020.00048
http://cds.cern.ch/record/2799880
_version_ 1780972589678592000
author Shaffer, Tim
Hazekamp, Nicholas
Blomer, Jakob
Thain, Douglas
author_facet Shaffer, Tim
Hazekamp, Nicholas
Blomer, Jakob
Thain, Douglas
author_sort Shaffer, Tim
collection CERN
description Container technologies are seeing wider use at advanced computing facilities for managing highly complex applications that must execute at multiple sites. However, in a distributed high throughput computing setting, the unrestricted use of containers can result in the container explosion problem. If a new container image is generated for each variation of a job dispatched to a site, shared storage is soon exceeded. On the other hand, if a single large container image is used to meet multiple needs, the size of that container may become a problem for storage and transport. To address this problem, we observe that many containers have an internal structure generated by a structured package manager, and this information could be used to strategically combine and share container images. We develop LANDLORD to exploit this property and evaluate its performance through a combination of simulation studies and empirical measurement of high energy physics applications.
id cern-2799880
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2020
record_format invenio
spelling cern-27998802022-01-20T13:36:58Zdoi:10.1109/ipdps47924.2020.00048http://cds.cern.ch/record/2799880engShaffer, TimHazekamp, NicholasBlomer, JakobThain, DouglasSolving the Container Explosion Problem for Distributed High Throughput ComputingComputing and ComputersContainer technologies are seeing wider use at advanced computing facilities for managing highly complex applications that must execute at multiple sites. However, in a distributed high throughput computing setting, the unrestricted use of containers can result in the container explosion problem. If a new container image is generated for each variation of a job dispatched to a site, shared storage is soon exceeded. On the other hand, if a single large container image is used to meet multiple needs, the size of that container may become a problem for storage and transport. To address this problem, we observe that many containers have an internal structure generated by a structured package manager, and this information could be used to strategically combine and share container images. We develop LANDLORD to exploit this property and evaluate its performance through a combination of simulation studies and empirical measurement of high energy physics applications.oai:cds.cern.ch:27998802020
spellingShingle Computing and Computers
Shaffer, Tim
Hazekamp, Nicholas
Blomer, Jakob
Thain, Douglas
Solving the Container Explosion Problem for Distributed High Throughput Computing
title Solving the Container Explosion Problem for Distributed High Throughput Computing
title_full Solving the Container Explosion Problem for Distributed High Throughput Computing
title_fullStr Solving the Container Explosion Problem for Distributed High Throughput Computing
title_full_unstemmed Solving the Container Explosion Problem for Distributed High Throughput Computing
title_short Solving the Container Explosion Problem for Distributed High Throughput Computing
title_sort solving the container explosion problem for distributed high throughput computing
topic Computing and Computers
url https://dx.doi.org/10.1109/ipdps47924.2020.00048
http://cds.cern.ch/record/2799880
work_keys_str_mv AT shaffertim solvingthecontainerexplosionproblemfordistributedhighthroughputcomputing
AT hazekampnicholas solvingthecontainerexplosionproblemfordistributedhighthroughputcomputing
AT blomerjakob solvingthecontainerexplosionproblemfordistributedhighthroughputcomputing
AT thaindouglas solvingthecontainerexplosionproblemfordistributedhighthroughputcomputing