Cargando…

Exploring data merging methods for a distributed processing system

The ALICE experiment at the CERN LHC (Large Hadron Collider) is undertaking a major upgrade during the LHC Long Shutdown 2 in 2019-2021, which includes a new computing system called O$^{2}$ (Online-Offline). The raw data input from the ALICE detectors will increase a hundredfold, up to 3.5 TB/s. By...

Descripción completa

Detalles Bibliográficos
Autores principales: Konopka, Piotr, von Haller, Barthélémy
Lenguaje:eng
Publicado: 2023
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/2438/1/012038
http://cds.cern.ch/record/2871822
_version_ 1780978569704374272
author Konopka, Piotr
von Haller, Barthélémy
author_facet Konopka, Piotr
von Haller, Barthélémy
author_sort Konopka, Piotr
collection CERN
description The ALICE experiment at the CERN LHC (Large Hadron Collider) is undertaking a major upgrade during the LHC Long Shutdown 2 in 2019-2021, which includes a new computing system called O$^{2}$ (Online-Offline). The raw data input from the ALICE detectors will increase a hundredfold, up to 3.5 TB/s. By reconstructing the data online, it will be possible to compress the data stream down to 100 GB/s before storing it permanently.The O$^{2}$ software is a message-passing system. It will run on approximately 500 computing nodes performing reconstruction, compression, calibration and quality control of the received data stream. As a direct consequence of having a distributed computing system, locally generated data might be incomplete and could require merging to obtain complete results.This paper presents the O$^{2}$ Mergers, the software designed to match and combine partial data into complete objects synchronously to data taking. Based on a detailed study and results of extensive benchmarks, a qualitative and quantitative comparison of different merging strategies considered to reach the final design and implementation of the software is discussed.
id cern-2871822
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2023
record_format invenio
spelling cern-28718222023-09-20T21:01:03Zdoi:10.1088/1742-6596/2438/1/012038http://cds.cern.ch/record/2871822engKonopka, Piotrvon Haller, BarthélémyExploring data merging methods for a distributed processing systemComputing and ComputersThe ALICE experiment at the CERN LHC (Large Hadron Collider) is undertaking a major upgrade during the LHC Long Shutdown 2 in 2019-2021, which includes a new computing system called O$^{2}$ (Online-Offline). The raw data input from the ALICE detectors will increase a hundredfold, up to 3.5 TB/s. By reconstructing the data online, it will be possible to compress the data stream down to 100 GB/s before storing it permanently.The O$^{2}$ software is a message-passing system. It will run on approximately 500 computing nodes performing reconstruction, compression, calibration and quality control of the received data stream. As a direct consequence of having a distributed computing system, locally generated data might be incomplete and could require merging to obtain complete results.This paper presents the O$^{2}$ Mergers, the software designed to match and combine partial data into complete objects synchronously to data taking. Based on a detailed study and results of extensive benchmarks, a qualitative and quantitative comparison of different merging strategies considered to reach the final design and implementation of the software is discussed.oai:cds.cern.ch:28718222023
spellingShingle Computing and Computers
Konopka, Piotr
von Haller, Barthélémy
Exploring data merging methods for a distributed processing system
title Exploring data merging methods for a distributed processing system
title_full Exploring data merging methods for a distributed processing system
title_fullStr Exploring data merging methods for a distributed processing system
title_full_unstemmed Exploring data merging methods for a distributed processing system
title_short Exploring data merging methods for a distributed processing system
title_sort exploring data merging methods for a distributed processing system
topic Computing and Computers
url https://dx.doi.org/10.1088/1742-6596/2438/1/012038
http://cds.cern.ch/record/2871822
work_keys_str_mv AT konopkapiotr exploringdatamergingmethodsforadistributedprocessingsystem
AT vonhallerbarthelemy exploringdatamergingmethodsforadistributedprocessingsystem