Cargando…
Exploring data merging methods for a distributed processing system
The ALICE experiment at the CERN LHC (Large Hadron Collider) is undertaking a major upgrade during the LHC Long Shutdown 2 in 2019-2021, which includes a new computing system called O$^{2}$ (Online-Offline). The raw data input from the ALICE detectors will increase a hundredfold, up to 3.5 TB/s. By...
Autores principales: | , |
---|---|
Lenguaje: | eng |
Publicado: |
2023
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1088/1742-6596/2438/1/012038 http://cds.cern.ch/record/2871822 |
_version_ | 1780978569704374272 |
---|---|
author | Konopka, Piotr von Haller, Barthélémy |
author_facet | Konopka, Piotr von Haller, Barthélémy |
author_sort | Konopka, Piotr |
collection | CERN |
description | The ALICE experiment at the CERN LHC (Large Hadron Collider) is undertaking a major upgrade during the LHC Long Shutdown 2 in 2019-2021, which includes a new computing system called O$^{2}$ (Online-Offline). The raw data input from the ALICE detectors will increase a hundredfold, up to 3.5 TB/s. By reconstructing the data online, it will be possible to compress the data stream down to 100 GB/s before storing it permanently.The O$^{2}$ software is a message-passing system. It will run on approximately 500 computing nodes performing reconstruction, compression, calibration and quality control of the received data stream. As a direct consequence of having a distributed computing system, locally generated data might be incomplete and could require merging to obtain complete results.This paper presents the O$^{2}$ Mergers, the software designed to match and combine partial data into complete objects synchronously to data taking. Based on a detailed study and results of extensive benchmarks, a qualitative and quantitative comparison of different merging strategies considered to reach the final design and implementation of the software is discussed. |
id | cern-2871822 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2023 |
record_format | invenio |
spelling | cern-28718222023-09-20T21:01:03Zdoi:10.1088/1742-6596/2438/1/012038http://cds.cern.ch/record/2871822engKonopka, Piotrvon Haller, BarthélémyExploring data merging methods for a distributed processing systemComputing and ComputersThe ALICE experiment at the CERN LHC (Large Hadron Collider) is undertaking a major upgrade during the LHC Long Shutdown 2 in 2019-2021, which includes a new computing system called O$^{2}$ (Online-Offline). The raw data input from the ALICE detectors will increase a hundredfold, up to 3.5 TB/s. By reconstructing the data online, it will be possible to compress the data stream down to 100 GB/s before storing it permanently.The O$^{2}$ software is a message-passing system. It will run on approximately 500 computing nodes performing reconstruction, compression, calibration and quality control of the received data stream. As a direct consequence of having a distributed computing system, locally generated data might be incomplete and could require merging to obtain complete results.This paper presents the O$^{2}$ Mergers, the software designed to match and combine partial data into complete objects synchronously to data taking. Based on a detailed study and results of extensive benchmarks, a qualitative and quantitative comparison of different merging strategies considered to reach the final design and implementation of the software is discussed.oai:cds.cern.ch:28718222023 |
spellingShingle | Computing and Computers Konopka, Piotr von Haller, Barthélémy Exploring data merging methods for a distributed processing system |
title | Exploring data merging methods for a distributed processing system |
title_full | Exploring data merging methods for a distributed processing system |
title_fullStr | Exploring data merging methods for a distributed processing system |
title_full_unstemmed | Exploring data merging methods for a distributed processing system |
title_short | Exploring data merging methods for a distributed processing system |
title_sort | exploring data merging methods for a distributed processing system |
topic | Computing and Computers |
url | https://dx.doi.org/10.1088/1742-6596/2438/1/012038 http://cds.cern.ch/record/2871822 |
work_keys_str_mv | AT konopkapiotr exploringdatamergingmethodsforadistributedprocessingsystem AT vonhallerbarthelemy exploringdatamergingmethodsforadistributedprocessingsystem |