Cargando…

ALICE Run 3 Analysis Framework

In LHC Run 3 the ALICE Collaboration will have to cope with an increase of lead-lead collision data of two orders of magnitude compared to the Run 1 and 2 data-taking periods. The Online-Offline (O2) software framework has been developed to allow for distributed and efficient processing of this unpr...

Descripción completa

Detalles Bibliográficos
Autores principales: Alkin, Anton, Eulisse, Giulio, Grosse-Oetringhaus, Jan Fiete, Hristov, Peter, Kabus, Maja
Lenguaje:eng
Publicado: 2021
Materias:
Acceso en línea:https://dx.doi.org/10.1051/epjconf/202125103063
http://cds.cern.ch/record/2814355
_version_ 1780973440678756352
author Alkin, Anton
Eulisse, Giulio
Grosse-Oetringhaus, Jan Fiete
Hristov, Peter
Kabus, Maja
author_facet Alkin, Anton
Eulisse, Giulio
Grosse-Oetringhaus, Jan Fiete
Hristov, Peter
Kabus, Maja
author_sort Alkin, Anton
collection CERN
description In LHC Run 3 the ALICE Collaboration will have to cope with an increase of lead-lead collision data of two orders of magnitude compared to the Run 1 and 2 data-taking periods. The Online-Offline (O2) software framework has been developed to allow for distributed and efficient processing of this unprecedented amount of data. Its design, which is based on a message-passing back end, required the development of a dedicated Analysis Framework that uses the columnar data format provided by Apache Arrow. The O2 Analysis Framework provides a user-friendly high-level interface and hides the complexity of the underlying distributed framework. It allows the users to access and manipulate the data in the new format both in the traditional “event loop” and a declarative approach using bulk processing operations based on Arrow’s Gandiva sub-project. Building on the well-tested system of analysis trains developed by ALICE in Run 1 and 2, the AliHyperloop infrastructure is being developed. It provides a fast and intuitive user interface for running demanding analysis workflows in the GRID environment and on the dedicated Analysis Facility. In this document, we report on the current state and ongoing developments of the Analysis Framework and of AliHyperloop, highlighting the design choices and the benefits of the new system.
id cern-2814355
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2021
record_format invenio
spelling cern-28143552022-07-02T18:07:40Zdoi:10.1051/epjconf/202125103063http://cds.cern.ch/record/2814355engAlkin, AntonEulisse, GiulioGrosse-Oetringhaus, Jan FieteHristov, PeterKabus, MajaALICE Run 3 Analysis FrameworkDetectors and Experimental TechniquesComputing and ComputersIn LHC Run 3 the ALICE Collaboration will have to cope with an increase of lead-lead collision data of two orders of magnitude compared to the Run 1 and 2 data-taking periods. The Online-Offline (O2) software framework has been developed to allow for distributed and efficient processing of this unprecedented amount of data. Its design, which is based on a message-passing back end, required the development of a dedicated Analysis Framework that uses the columnar data format provided by Apache Arrow. The O2 Analysis Framework provides a user-friendly high-level interface and hides the complexity of the underlying distributed framework. It allows the users to access and manipulate the data in the new format both in the traditional “event loop” and a declarative approach using bulk processing operations based on Arrow’s Gandiva sub-project. Building on the well-tested system of analysis trains developed by ALICE in Run 1 and 2, the AliHyperloop infrastructure is being developed. It provides a fast and intuitive user interface for running demanding analysis workflows in the GRID environment and on the dedicated Analysis Facility. In this document, we report on the current state and ongoing developments of the Analysis Framework and of AliHyperloop, highlighting the design choices and the benefits of the new system.oai:cds.cern.ch:28143552021
spellingShingle Detectors and Experimental Techniques
Computing and Computers
Alkin, Anton
Eulisse, Giulio
Grosse-Oetringhaus, Jan Fiete
Hristov, Peter
Kabus, Maja
ALICE Run 3 Analysis Framework
title ALICE Run 3 Analysis Framework
title_full ALICE Run 3 Analysis Framework
title_fullStr ALICE Run 3 Analysis Framework
title_full_unstemmed ALICE Run 3 Analysis Framework
title_short ALICE Run 3 Analysis Framework
title_sort alice run 3 analysis framework
topic Detectors and Experimental Techniques
Computing and Computers
url https://dx.doi.org/10.1051/epjconf/202125103063
http://cds.cern.ch/record/2814355
work_keys_str_mv AT alkinanton alicerun3analysisframework
AT eulissegiulio alicerun3analysisframework
AT grosseoetringhausjanfiete alicerun3analysisframework
AT hristovpeter alicerun3analysisframework
AT kabusmaja alicerun3analysisframework