Cargando…

A caching mechanism to exploit object store speed in High Energy Physics analysis

AbstractData analysis workflows in High Energy Physics (HEP) read data written in the ROOT columnar format. Such data has traditionally been stored in files that are often read via the network from remote storage facilities, which represents a performance penalty especially for data processing workf...

Descripción completa

Detalles Bibliográficos
Autores principales: Padulano, Vincenzo Eduardo, Saavedra, Enric Tejedor, Alonso-Jorda, Pedro, Gomez, Javier Lopez, Blomer, Jakob
Lenguaje:eng
Publicado: 2022
Materias:
Acceso en línea:https://dx.doi.org/10.1007/s10586-022-03757-2
http://cds.cern.ch/record/2852823
_version_ 1780977168891772928
author Padulano, Vincenzo Eduardo
Saavedra, Enric Tejedor
Alonso-Jorda, Pedro
Gomez, Javier Lopez
Blomer, Jakob
author_facet Padulano, Vincenzo Eduardo
Saavedra, Enric Tejedor
Alonso-Jorda, Pedro
Gomez, Javier Lopez
Blomer, Jakob
author_sort Padulano, Vincenzo Eduardo
collection CERN
description AbstractData analysis workflows in High Energy Physics (HEP) read data written in the ROOT columnar format. Such data has traditionally been stored in files that are often read via the network from remote storage facilities, which represents a performance penalty especially for data processing workflows that are I/O bound. To address that issue, this paper presents a new caching mechanism, implemented in the I/O subsystem of ROOT, which is independent of the storage backend used to write the dataset. Notably, it can be used to leverage the speed of high-bandwidth, low-latency object stores. The performance of this caching approach is evaluated by running a real physics analysis on an Intel DAOS cluster, both on a single node and distributed on multiple nodes.
id cern-2852823
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2022
record_format invenio
spelling cern-28528232023-03-18T19:34:45Zdoi:10.1007/s10586-022-03757-2http://cds.cern.ch/record/2852823engPadulano, Vincenzo EduardoSaavedra, Enric TejedorAlonso-Jorda, PedroGomez, Javier LopezBlomer, JakobA caching mechanism to exploit object store speed in High Energy Physics analysisComputing and ComputersParticle Physics - ExperimentAbstractData analysis workflows in High Energy Physics (HEP) read data written in the ROOT columnar format. Such data has traditionally been stored in files that are often read via the network from remote storage facilities, which represents a performance penalty especially for data processing workflows that are I/O bound. To address that issue, this paper presents a new caching mechanism, implemented in the I/O subsystem of ROOT, which is independent of the storage backend used to write the dataset. Notably, it can be used to leverage the speed of high-bandwidth, low-latency object stores. The performance of this caching approach is evaluated by running a real physics analysis on an Intel DAOS cluster, both on a single node and distributed on multiple nodes.oai:cds.cern.ch:28528232022
spellingShingle Computing and Computers
Particle Physics - Experiment
Padulano, Vincenzo Eduardo
Saavedra, Enric Tejedor
Alonso-Jorda, Pedro
Gomez, Javier Lopez
Blomer, Jakob
A caching mechanism to exploit object store speed in High Energy Physics analysis
title A caching mechanism to exploit object store speed in High Energy Physics analysis
title_full A caching mechanism to exploit object store speed in High Energy Physics analysis
title_fullStr A caching mechanism to exploit object store speed in High Energy Physics analysis
title_full_unstemmed A caching mechanism to exploit object store speed in High Energy Physics analysis
title_short A caching mechanism to exploit object store speed in High Energy Physics analysis
title_sort caching mechanism to exploit object store speed in high energy physics analysis
topic Computing and Computers
Particle Physics - Experiment
url https://dx.doi.org/10.1007/s10586-022-03757-2
http://cds.cern.ch/record/2852823
work_keys_str_mv AT padulanovincenzoeduardo acachingmechanismtoexploitobjectstorespeedinhighenergyphysicsanalysis
AT saavedraenrictejedor acachingmechanismtoexploitobjectstorespeedinhighenergyphysicsanalysis
AT alonsojordapedro acachingmechanismtoexploitobjectstorespeedinhighenergyphysicsanalysis
AT gomezjavierlopez acachingmechanismtoexploitobjectstorespeedinhighenergyphysicsanalysis
AT blomerjakob acachingmechanismtoexploitobjectstorespeedinhighenergyphysicsanalysis
AT padulanovincenzoeduardo cachingmechanismtoexploitobjectstorespeedinhighenergyphysicsanalysis
AT saavedraenrictejedor cachingmechanismtoexploitobjectstorespeedinhighenergyphysicsanalysis
AT alonsojordapedro cachingmechanismtoexploitobjectstorespeedinhighenergyphysicsanalysis
AT gomezjavierlopez cachingmechanismtoexploitobjectstorespeedinhighenergyphysicsanalysis
AT blomerjakob cachingmechanismtoexploitobjectstorespeedinhighenergyphysicsanalysis