Cargando…
Solving the Container Explosion Problem for Distributed High Throughput Computing
Container technologies are seeing wider use at advanced computing facilities for managing highly complex applications that must execute at multiple sites. However, in a distributed high throughput computing setting, the unrestricted use of containers can result in the container explosion problem. If...
Autores principales: | , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1109/ipdps47924.2020.00048 http://cds.cern.ch/record/2799880 |
_version_ | 1780972589678592000 |
---|---|
author | Shaffer, Tim Hazekamp, Nicholas Blomer, Jakob Thain, Douglas |
author_facet | Shaffer, Tim Hazekamp, Nicholas Blomer, Jakob Thain, Douglas |
author_sort | Shaffer, Tim |
collection | CERN |
description | Container technologies are seeing wider use at
advanced computing facilities for managing highly complex
applications that must execute at multiple sites. However, in a
distributed high throughput computing setting, the unrestricted
use of containers can result in the container explosion problem.
If a new container image is generated for each variation of a
job dispatched to a site, shared storage is soon exceeded. On
the other hand, if a single large container image is used to meet
multiple needs, the size of that container may become a problem
for storage and transport. To address this problem, we observe
that many containers have an internal structure generated by
a structured package manager, and this information could be
used to strategically combine and share container images. We
develop LANDLORD to exploit this property and evaluate its
performance through a combination of simulation studies and
empirical measurement of high energy physics applications. |
id | cern-2799880 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2020 |
record_format | invenio |
spelling | cern-27998802022-01-20T13:36:58Zdoi:10.1109/ipdps47924.2020.00048http://cds.cern.ch/record/2799880engShaffer, TimHazekamp, NicholasBlomer, JakobThain, DouglasSolving the Container Explosion Problem for Distributed High Throughput ComputingComputing and ComputersContainer technologies are seeing wider use at advanced computing facilities for managing highly complex applications that must execute at multiple sites. However, in a distributed high throughput computing setting, the unrestricted use of containers can result in the container explosion problem. If a new container image is generated for each variation of a job dispatched to a site, shared storage is soon exceeded. On the other hand, if a single large container image is used to meet multiple needs, the size of that container may become a problem for storage and transport. To address this problem, we observe that many containers have an internal structure generated by a structured package manager, and this information could be used to strategically combine and share container images. We develop LANDLORD to exploit this property and evaluate its performance through a combination of simulation studies and empirical measurement of high energy physics applications.oai:cds.cern.ch:27998802020 |
spellingShingle | Computing and Computers Shaffer, Tim Hazekamp, Nicholas Blomer, Jakob Thain, Douglas Solving the Container Explosion Problem for Distributed High Throughput Computing |
title | Solving the Container Explosion Problem for Distributed High Throughput Computing |
title_full | Solving the Container Explosion Problem for Distributed High Throughput Computing |
title_fullStr | Solving the Container Explosion Problem for Distributed High Throughput Computing |
title_full_unstemmed | Solving the Container Explosion Problem for Distributed High Throughput Computing |
title_short | Solving the Container Explosion Problem for Distributed High Throughput Computing |
title_sort | solving the container explosion problem for distributed high throughput computing |
topic | Computing and Computers |
url | https://dx.doi.org/10.1109/ipdps47924.2020.00048 http://cds.cern.ch/record/2799880 |
work_keys_str_mv | AT shaffertim solvingthecontainerexplosionproblemfordistributedhighthroughputcomputing AT hazekampnicholas solvingthecontainerexplosionproblemfordistributedhighthroughputcomputing AT blomerjakob solvingthecontainerexplosionproblemfordistributedhighthroughputcomputing AT thaindouglas solvingthecontainerexplosionproblemfordistributedhighthroughputcomputing |