Cargando…

ARC-CE+HyperQueue based submission system of ATLAS jobs for the Karolina HPC

For several years, the distributed computing of the ATLAS experiment at the LHC has been granted opportunistic use of computing resources of the Czech national HPC centre, IT4Innovations. With the introduction of Karolina HPC, resources provided to ATLAS significantly increased, but with lower effic...

Descripción completa

Detalles Bibliográficos
Autores principales: Svatos, Michal, Chudoba, Jiri, Vokac, Petr
Lenguaje:eng
Publicado: 2022
Materias:
Acceso en línea:http://cds.cern.ch/record/2837871
_version_ 1780975895239983104
author Svatos, Michal
Chudoba, Jiri
Vokac, Petr
author_facet Svatos, Michal
Chudoba, Jiri
Vokac, Petr
author_sort Svatos, Michal
collection CERN
description For several years, the distributed computing of the ATLAS experiment at the LHC has been granted opportunistic use of computing resources of the Czech national HPC centre, IT4Innovations. With the introduction of Karolina HPC, resources provided to ATLAS significantly increased, but with lower efficiency. The inefficiency arose because ATLAS jobs, designed for 8 cores, have a rather short multiprocessing phase on the 128 cores of Karolina's worker nodes in comparison with initialization and finalization running on a single core. To ensure efficient usage, HyperQueue was implemented in the ARC-CE based submission system. This enables four 32-core jobs to be sent to each worker node, significantly improving CPU efficiency without leaving empty resources.
id cern-2837871
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2022
record_format invenio
spelling cern-28378712022-10-20T21:05:08Zhttp://cds.cern.ch/record/2837871engSvatos, MichalChudoba, JiriVokac, PetrARC-CE+HyperQueue based submission system of ATLAS jobs for the Karolina HPCParticle Physics - ExperimentFor several years, the distributed computing of the ATLAS experiment at the LHC has been granted opportunistic use of computing resources of the Czech national HPC centre, IT4Innovations. With the introduction of Karolina HPC, resources provided to ATLAS significantly increased, but with lower efficiency. The inefficiency arose because ATLAS jobs, designed for 8 cores, have a rather short multiprocessing phase on the 128 cores of Karolina's worker nodes in comparison with initialization and finalization running on a single core. To ensure efficient usage, HyperQueue was implemented in the ARC-CE based submission system. This enables four 32-core jobs to be sent to each worker node, significantly improving CPU efficiency without leaving empty resources.ATL-SOFT-SLIDE-2022-550oai:cds.cern.ch:28378712022-10-20
spellingShingle Particle Physics - Experiment
Svatos, Michal
Chudoba, Jiri
Vokac, Petr
ARC-CE+HyperQueue based submission system of ATLAS jobs for the Karolina HPC
title ARC-CE+HyperQueue based submission system of ATLAS jobs for the Karolina HPC
title_full ARC-CE+HyperQueue based submission system of ATLAS jobs for the Karolina HPC
title_fullStr ARC-CE+HyperQueue based submission system of ATLAS jobs for the Karolina HPC
title_full_unstemmed ARC-CE+HyperQueue based submission system of ATLAS jobs for the Karolina HPC
title_short ARC-CE+HyperQueue based submission system of ATLAS jobs for the Karolina HPC
title_sort arc-ce+hyperqueue based submission system of atlas jobs for the karolina hpc
topic Particle Physics - Experiment
url http://cds.cern.ch/record/2837871
work_keys_str_mv AT svatosmichal arccehyperqueuebasedsubmissionsystemofatlasjobsforthekarolinahpc
AT chudobajiri arccehyperqueuebasedsubmissionsystemofatlasjobsforthekarolinahpc
AT vokacpetr arccehyperqueuebasedsubmissionsystemofatlasjobsforthekarolinahpc