Cargando…

Framework for custom event sample augmentations for ATLAS analysis data

For HEP event processing, data is typically stored in column-wise synchronized containers, such as most prominently ROOT’s TTree, which have been used for several decades to store by now over 1 exabyte. These containers can combine row-wise association capabilities needed by most HEP event processin...

Descripción completa

Detalles Bibliográficos
Autores principales: Van Gemmeren, Peter, Mete, Alaettin Serhan, Nowak, Marcin Jerzy, Catmore, James, Burzynski, Jackson Carl, Krumnack, Nils Erik, Heinrich, Lukas Alexander
Lenguaje:eng
Publicado: 2023
Materias:
Acceso en línea:http://cds.cern.ch/record/2866625
_version_ 1780978105047842816
author Van Gemmeren, Peter
Mete, Alaettin Serhan
Nowak, Marcin Jerzy
Catmore, James
Burzynski, Jackson Carl
Krumnack, Nils Erik
Heinrich, Lukas Alexander
author_facet Van Gemmeren, Peter
Mete, Alaettin Serhan
Nowak, Marcin Jerzy
Catmore, James
Burzynski, Jackson Carl
Krumnack, Nils Erik
Heinrich, Lukas Alexander
author_sort Van Gemmeren, Peter
collection CERN
description For HEP event processing, data is typically stored in column-wise synchronized containers, such as most prominently ROOT’s TTree, which have been used for several decades to store by now over 1 exabyte. These containers can combine row-wise association capabilities needed by most HEP event processing frameworks (e.g. Athena for ATLAS) with column-wise storage, which typically results in better compression and more efficient support for many analysis use-cases. One disadvantage is that these containers, TTree in the HEP use-case, require to contain the same attributes for each entry/row (representing events), which can make extending the list of attributes very costly in storage, even if those are only required for a small subsample of events. Since the initial design, the ATLAS software framework features powerful navigational infrastructure to allow storing custom data extensions for subsample of events in separate, but synchronized containers. This allows adding event augmentations to ATLAS standard data products (such as DAOD-PHYS or PHYSLITE) avoiding duplication of those core data products, while limiting their size increase. For this functionality, the framework does not rely on any associations made by the I/O technology (i.e. ROOT), however it supports TTree friends and builds the associated index to allow for analysis outside of the ATLAS framework. A prototype based on the Long-Lived Particle search is implemented and preliminary results with this prototype will be presented. At this point, augmented data are stored within the same file as the core data. Storing them in separate files will be investigated in future, as this could provide more flexibility, e.g. certain sites may only want a subset of several augmentations or augmentations can be archived to disk once their analysis is complete.
id cern-2866625
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2023
record_format invenio
spelling cern-28666252023-08-03T20:42:12Zhttp://cds.cern.ch/record/2866625engVan Gemmeren, PeterMete, Alaettin SerhanNowak, Marcin JerzyCatmore, JamesBurzynski, Jackson CarlKrumnack, Nils ErikHeinrich, Lukas AlexanderFramework for custom event sample augmentations for ATLAS analysis dataParticle Physics - ExperimentFor HEP event processing, data is typically stored in column-wise synchronized containers, such as most prominently ROOT’s TTree, which have been used for several decades to store by now over 1 exabyte. These containers can combine row-wise association capabilities needed by most HEP event processing frameworks (e.g. Athena for ATLAS) with column-wise storage, which typically results in better compression and more efficient support for many analysis use-cases. One disadvantage is that these containers, TTree in the HEP use-case, require to contain the same attributes for each entry/row (representing events), which can make extending the list of attributes very costly in storage, even if those are only required for a small subsample of events. Since the initial design, the ATLAS software framework features powerful navigational infrastructure to allow storing custom data extensions for subsample of events in separate, but synchronized containers. This allows adding event augmentations to ATLAS standard data products (such as DAOD-PHYS or PHYSLITE) avoiding duplication of those core data products, while limiting their size increase. For this functionality, the framework does not rely on any associations made by the I/O technology (i.e. ROOT), however it supports TTree friends and builds the associated index to allow for analysis outside of the ATLAS framework. A prototype based on the Long-Lived Particle search is implemented and preliminary results with this prototype will be presented. At this point, augmented data are stored within the same file as the core data. Storing them in separate files will be investigated in future, as this could provide more flexibility, e.g. certain sites may only want a subset of several augmentations or augmentations can be archived to disk once their analysis is complete.ATL-SOFT-PROC-2023-003oai:cds.cern.ch:28666252023-08-02
spellingShingle Particle Physics - Experiment
Van Gemmeren, Peter
Mete, Alaettin Serhan
Nowak, Marcin Jerzy
Catmore, James
Burzynski, Jackson Carl
Krumnack, Nils Erik
Heinrich, Lukas Alexander
Framework for custom event sample augmentations for ATLAS analysis data
title Framework for custom event sample augmentations for ATLAS analysis data
title_full Framework for custom event sample augmentations for ATLAS analysis data
title_fullStr Framework for custom event sample augmentations for ATLAS analysis data
title_full_unstemmed Framework for custom event sample augmentations for ATLAS analysis data
title_short Framework for custom event sample augmentations for ATLAS analysis data
title_sort framework for custom event sample augmentations for atlas analysis data
topic Particle Physics - Experiment
url http://cds.cern.ch/record/2866625
work_keys_str_mv AT vangemmerenpeter frameworkforcustomeventsampleaugmentationsforatlasanalysisdata
AT metealaettinserhan frameworkforcustomeventsampleaugmentationsforatlasanalysisdata
AT nowakmarcinjerzy frameworkforcustomeventsampleaugmentationsforatlasanalysisdata
AT catmorejames frameworkforcustomeventsampleaugmentationsforatlasanalysisdata
AT burzynskijacksoncarl frameworkforcustomeventsampleaugmentationsforatlasanalysisdata
AT krumnacknilserik frameworkforcustomeventsampleaugmentationsforatlasanalysisdata
AT heinrichlukasalexander frameworkforcustomeventsampleaugmentationsforatlasanalysisdata