Cargando…

Harnessing the power of supercomputers using the PanDA Pilot 2 in the ATLAS Experiment

The unprecedented computing resource needs of the ATLAS experiment have motivated the Collaboration to become a leader in exploiting High Performance Computers (HPCs). To meet the requirements of HPCs, the PanDA system has been equipped with two new components; Pilot 2 and Harvester, that were desig...

Descripción completa

Detalles Bibliográficos
Autores principales: Nilsson, Paul, Benjamin, Douglas, Oleynik, Danila, Anisenkov, Alexey, Guan, Wen, Javurek, Tomas
Lenguaje:eng
Publicado: 2019
Materias:
Acceso en línea:http://cds.cern.ch/record/2697376
_version_ 1780964212604928000
author Nilsson, Paul
Benjamin, Douglas
Oleynik, Danila
Anisenkov, Alexey
Guan, Wen
Javurek, Tomas
author_facet Nilsson, Paul
Benjamin, Douglas
Oleynik, Danila
Anisenkov, Alexey
Guan, Wen
Javurek, Tomas
author_sort Nilsson, Paul
collection CERN
description The unprecedented computing resource needs of the ATLAS experiment have motivated the Collaboration to become a leader in exploiting High Performance Computers (HPCs). To meet the requirements of HPCs, the PanDA system has been equipped with two new components; Pilot 2 and Harvester, that were designed with HPCs in mind. While Harvester is a resource-facing service which provides resource provisioning and workload shaping, Pilot 2 is responsible for payload execution on the resource. The presentation will focus on Pilot 2, which is a complete rewrite of the original PanDA Pilot used by ATLAS and other experiments for well over a decade. Pilot 2 has a flexible and adaptive design that allows for plugins to be defined with streamlined workflows. In particular, it has plugins for specific hardware infrastructures (HPC/GPU clusters) as well as for dedicated workflows defined by the needs of an experiment. Examples of dedicated HPC workflows in which the Pilot either acts like an MPI application that runs a set of jobs in an assembly, or by using the Yoda-Droid tools in the ATLAS Event Service mode, under the control of the Harvester service, are discussed. In addition to describing the technical details of these workflows, results are shown from its deployment on Cori (NERSC), Theta (ALCF), Titan and Summit (OLCF).
id cern-2697376
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2019
record_format invenio
spelling cern-26973762019-10-31T21:08:22Zhttp://cds.cern.ch/record/2697376engNilsson, PaulBenjamin, DouglasOleynik, DanilaAnisenkov, AlexeyGuan, WenJavurek, TomasHarnessing the power of supercomputers using the PanDA Pilot 2 in the ATLAS ExperimentParticle Physics - ExperimentThe unprecedented computing resource needs of the ATLAS experiment have motivated the Collaboration to become a leader in exploiting High Performance Computers (HPCs). To meet the requirements of HPCs, the PanDA system has been equipped with two new components; Pilot 2 and Harvester, that were designed with HPCs in mind. While Harvester is a resource-facing service which provides resource provisioning and workload shaping, Pilot 2 is responsible for payload execution on the resource. The presentation will focus on Pilot 2, which is a complete rewrite of the original PanDA Pilot used by ATLAS and other experiments for well over a decade. Pilot 2 has a flexible and adaptive design that allows for plugins to be defined with streamlined workflows. In particular, it has plugins for specific hardware infrastructures (HPC/GPU clusters) as well as for dedicated workflows defined by the needs of an experiment. Examples of dedicated HPC workflows in which the Pilot either acts like an MPI application that runs a set of jobs in an assembly, or by using the Yoda-Droid tools in the ATLAS Event Service mode, under the control of the Harvester service, are discussed. In addition to describing the technical details of these workflows, results are shown from its deployment on Cori (NERSC), Theta (ALCF), Titan and Summit (OLCF).ATL-SOFT-SLIDE-2019-821oai:cds.cern.ch:26973762019-10-31
spellingShingle Particle Physics - Experiment
Nilsson, Paul
Benjamin, Douglas
Oleynik, Danila
Anisenkov, Alexey
Guan, Wen
Javurek, Tomas
Harnessing the power of supercomputers using the PanDA Pilot 2 in the ATLAS Experiment
title Harnessing the power of supercomputers using the PanDA Pilot 2 in the ATLAS Experiment
title_full Harnessing the power of supercomputers using the PanDA Pilot 2 in the ATLAS Experiment
title_fullStr Harnessing the power of supercomputers using the PanDA Pilot 2 in the ATLAS Experiment
title_full_unstemmed Harnessing the power of supercomputers using the PanDA Pilot 2 in the ATLAS Experiment
title_short Harnessing the power of supercomputers using the PanDA Pilot 2 in the ATLAS Experiment
title_sort harnessing the power of supercomputers using the panda pilot 2 in the atlas experiment
topic Particle Physics - Experiment
url http://cds.cern.ch/record/2697376
work_keys_str_mv AT nilssonpaul harnessingthepowerofsupercomputersusingthepandapilot2intheatlasexperiment
AT benjamindouglas harnessingthepowerofsupercomputersusingthepandapilot2intheatlasexperiment
AT oleynikdanila harnessingthepowerofsupercomputersusingthepandapilot2intheatlasexperiment
AT anisenkovalexey harnessingthepowerofsupercomputersusingthepandapilot2intheatlasexperiment
AT guanwen harnessingthepowerofsupercomputersusingthepandapilot2intheatlasexperiment
AT javurektomas harnessingthepowerofsupercomputersusingthepandapilot2intheatlasexperiment