Cargando…

Shared I/O Developments for Run 3 in the ATLAS Experiment

The ATLAS experiment extensively uses multi-process (MP) parallelism to maximize data-throughput especially in I/O intensive workflows, such as the production of Derived Analysis Object Data (DAOD). In this mode, worker processes are spawned at the end of job initialization, thereby sharing memory a...

Descripción completa

Detalles Bibliográficos
Autores principales: Mete, Alaettin Serhan, Van Gemmeren, Peter
Lenguaje:eng
Publicado: 2022
Materias:
Acceso en línea:http://cds.cern.ch/record/2815182
_version_ 1780973494264135680
author Mete, Alaettin Serhan
Van Gemmeren, Peter
author_facet Mete, Alaettin Serhan
Van Gemmeren, Peter
author_sort Mete, Alaettin Serhan
collection CERN
description The ATLAS experiment extensively uses multi-process (MP) parallelism to maximize data-throughput especially in I/O intensive workflows, such as the production of Derived Analysis Object Data (DAOD). In this mode, worker processes are spawned at the end of job initialization, thereby sharing memory allocated thus far. Each worker then loops over a unique set of events and produces its own output file, which in the original implementation needed to be merged at a subsequent step that would be executed serially. In Run 2, SharedWriter was introduced to perform this task on-the-fly, with an additional process merging data from the workers while the job was running, eliminating the need for the extra merging step. Although this approach had been very successful, there was room for improvements, most notably in the event-throughput scaling as a function of the number of workers. This was limited by the fact that the Run 2 version does all data compression within the SharedWriter process. For Run 3, a new version of SharedWriter has been written to address the limitations of the original implementation by moving compression of data to the worker processes. This development also paves the way for using it in a hybrid mode of multi-thread (MT) and MP workflows to maximize the I/O efficiency. In this talk, we will discuss the latest developments in Shared I/O in the ATLAS experiment.
id cern-2815182
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2022
record_format invenio
spelling cern-28151822023-10-06T13:39:37Zhttp://cds.cern.ch/record/2815182engMete, Alaettin SerhanVan Gemmeren, PeterShared I/O Developments for Run 3 in the ATLAS ExperimentParticle Physics - ExperimentThe ATLAS experiment extensively uses multi-process (MP) parallelism to maximize data-throughput especially in I/O intensive workflows, such as the production of Derived Analysis Object Data (DAOD). In this mode, worker processes are spawned at the end of job initialization, thereby sharing memory allocated thus far. Each worker then loops over a unique set of events and produces its own output file, which in the original implementation needed to be merged at a subsequent step that would be executed serially. In Run 2, SharedWriter was introduced to perform this task on-the-fly, with an additional process merging data from the workers while the job was running, eliminating the need for the extra merging step. Although this approach had been very successful, there was room for improvements, most notably in the event-throughput scaling as a function of the number of workers. This was limited by the fact that the Run 2 version does all data compression within the SharedWriter process. For Run 3, a new version of SharedWriter has been written to address the limitations of the original implementation by moving compression of data to the worker processes. This development also paves the way for using it in a hybrid mode of multi-thread (MT) and MP workflows to maximize the I/O efficiency. In this talk, we will discuss the latest developments in Shared I/O in the ATLAS experiment.ATL-SOFT-SLIDE-2022-255oai:cds.cern.ch:28151822022-07-07
spellingShingle Particle Physics - Experiment
Mete, Alaettin Serhan
Van Gemmeren, Peter
Shared I/O Developments for Run 3 in the ATLAS Experiment
title Shared I/O Developments for Run 3 in the ATLAS Experiment
title_full Shared I/O Developments for Run 3 in the ATLAS Experiment
title_fullStr Shared I/O Developments for Run 3 in the ATLAS Experiment
title_full_unstemmed Shared I/O Developments for Run 3 in the ATLAS Experiment
title_short Shared I/O Developments for Run 3 in the ATLAS Experiment
title_sort shared i/o developments for run 3 in the atlas experiment
topic Particle Physics - Experiment
url http://cds.cern.ch/record/2815182
work_keys_str_mv AT metealaettinserhan sharediodevelopmentsforrun3intheatlasexperiment
AT vangemmerenpeter sharediodevelopmentsforrun3intheatlasexperiment