Cargando…
Shared I/O Developments for Run 3 in the ATLAS Experiment
The ATLAS experiment extensively uses multi-process (MP) parallelism to maximize data-throughput especially in I/O intensive workflows, such as the production of Derived Analysis Object Data (DAOD). In this mode, worker processes are spawned at the end of job initialization, thereby sharing memory a...
Autores principales: | , |
---|---|
Lenguaje: | eng |
Publicado: |
2022
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/2815182 |
_version_ | 1780973494264135680 |
---|---|
author | Mete, Alaettin Serhan Van Gemmeren, Peter |
author_facet | Mete, Alaettin Serhan Van Gemmeren, Peter |
author_sort | Mete, Alaettin Serhan |
collection | CERN |
description | The ATLAS experiment extensively uses multi-process (MP) parallelism to maximize data-throughput especially in I/O intensive workflows, such as the production of Derived Analysis Object Data (DAOD). In this mode, worker processes are spawned at the end of job initialization, thereby sharing memory allocated thus far. Each worker then loops over a unique set of events and produces its own output file, which in the original implementation needed to be merged at a subsequent step that would be executed serially. In Run 2, SharedWriter was introduced to perform this task on-the-fly, with an additional process merging data from the workers while the job was running, eliminating the need for the extra merging step. Although this approach had been very successful, there was room for improvements, most notably in the event-throughput scaling as a function of the number of workers. This was limited by the fact that the Run 2 version does all data compression within the SharedWriter process. For Run 3, a new version of SharedWriter has been written to address the limitations of the original implementation by moving compression of data to the worker processes. This development also paves the way for using it in a hybrid mode of multi-thread (MT) and MP workflows to maximize the I/O efficiency. In this talk, we will discuss the latest developments in Shared I/O in the ATLAS experiment. |
id | cern-2815182 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2022 |
record_format | invenio |
spelling | cern-28151822023-10-06T13:39:37Zhttp://cds.cern.ch/record/2815182engMete, Alaettin SerhanVan Gemmeren, PeterShared I/O Developments for Run 3 in the ATLAS ExperimentParticle Physics - ExperimentThe ATLAS experiment extensively uses multi-process (MP) parallelism to maximize data-throughput especially in I/O intensive workflows, such as the production of Derived Analysis Object Data (DAOD). In this mode, worker processes are spawned at the end of job initialization, thereby sharing memory allocated thus far. Each worker then loops over a unique set of events and produces its own output file, which in the original implementation needed to be merged at a subsequent step that would be executed serially. In Run 2, SharedWriter was introduced to perform this task on-the-fly, with an additional process merging data from the workers while the job was running, eliminating the need for the extra merging step. Although this approach had been very successful, there was room for improvements, most notably in the event-throughput scaling as a function of the number of workers. This was limited by the fact that the Run 2 version does all data compression within the SharedWriter process. For Run 3, a new version of SharedWriter has been written to address the limitations of the original implementation by moving compression of data to the worker processes. This development also paves the way for using it in a hybrid mode of multi-thread (MT) and MP workflows to maximize the I/O efficiency. In this talk, we will discuss the latest developments in Shared I/O in the ATLAS experiment.ATL-SOFT-SLIDE-2022-255oai:cds.cern.ch:28151822022-07-07 |
spellingShingle | Particle Physics - Experiment Mete, Alaettin Serhan Van Gemmeren, Peter Shared I/O Developments for Run 3 in the ATLAS Experiment |
title | Shared I/O Developments for Run 3 in the ATLAS Experiment |
title_full | Shared I/O Developments for Run 3 in the ATLAS Experiment |
title_fullStr | Shared I/O Developments for Run 3 in the ATLAS Experiment |
title_full_unstemmed | Shared I/O Developments for Run 3 in the ATLAS Experiment |
title_short | Shared I/O Developments for Run 3 in the ATLAS Experiment |
title_sort | shared i/o developments for run 3 in the atlas experiment |
topic | Particle Physics - Experiment |
url | http://cds.cern.ch/record/2815182 |
work_keys_str_mv | AT metealaettinserhan sharediodevelopmentsforrun3intheatlasexperiment AT vangemmerenpeter sharediodevelopmentsforrun3intheatlasexperiment |