Cargando…
A caching mechanism to exploit object store speed in High Energy Physics analysis
AbstractData analysis workflows in High Energy Physics (HEP) read data written in the ROOT columnar format. Such data has traditionally been stored in files that are often read via the network from remote storage facilities, which represents a performance penalty especially for data processing workf...
Autores principales: | , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2022
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1007/s10586-022-03757-2 http://cds.cern.ch/record/2852823 |
_version_ | 1780977168891772928 |
---|---|
author | Padulano, Vincenzo Eduardo Saavedra, Enric Tejedor Alonso-Jorda, Pedro Gomez, Javier Lopez Blomer, Jakob |
author_facet | Padulano, Vincenzo Eduardo Saavedra, Enric Tejedor Alonso-Jorda, Pedro Gomez, Javier Lopez Blomer, Jakob |
author_sort | Padulano, Vincenzo Eduardo |
collection | CERN |
description | AbstractData analysis workflows in High Energy Physics (HEP) read data written in the ROOT columnar format. Such data has traditionally been stored in files that are often read via the network from remote storage facilities, which represents a performance penalty especially for data processing workflows that are I/O bound. To address that issue, this paper presents a new caching mechanism, implemented in the I/O subsystem of ROOT, which is independent of the storage backend used to write the dataset. Notably, it can be used to leverage the speed of high-bandwidth, low-latency object stores. The performance of this caching approach is evaluated by running a real physics analysis on an Intel DAOS cluster, both on a single node and distributed on multiple nodes. |
id | cern-2852823 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2022 |
record_format | invenio |
spelling | cern-28528232023-03-18T19:34:45Zdoi:10.1007/s10586-022-03757-2http://cds.cern.ch/record/2852823engPadulano, Vincenzo EduardoSaavedra, Enric TejedorAlonso-Jorda, PedroGomez, Javier LopezBlomer, JakobA caching mechanism to exploit object store speed in High Energy Physics analysisComputing and ComputersParticle Physics - ExperimentAbstractData analysis workflows in High Energy Physics (HEP) read data written in the ROOT columnar format. Such data has traditionally been stored in files that are often read via the network from remote storage facilities, which represents a performance penalty especially for data processing workflows that are I/O bound. To address that issue, this paper presents a new caching mechanism, implemented in the I/O subsystem of ROOT, which is independent of the storage backend used to write the dataset. Notably, it can be used to leverage the speed of high-bandwidth, low-latency object stores. The performance of this caching approach is evaluated by running a real physics analysis on an Intel DAOS cluster, both on a single node and distributed on multiple nodes.oai:cds.cern.ch:28528232022 |
spellingShingle | Computing and Computers Particle Physics - Experiment Padulano, Vincenzo Eduardo Saavedra, Enric Tejedor Alonso-Jorda, Pedro Gomez, Javier Lopez Blomer, Jakob A caching mechanism to exploit object store speed in High Energy Physics analysis |
title | A caching mechanism to exploit object store speed in High Energy Physics analysis |
title_full | A caching mechanism to exploit object store speed in High Energy Physics analysis |
title_fullStr | A caching mechanism to exploit object store speed in High Energy Physics analysis |
title_full_unstemmed | A caching mechanism to exploit object store speed in High Energy Physics analysis |
title_short | A caching mechanism to exploit object store speed in High Energy Physics analysis |
title_sort | caching mechanism to exploit object store speed in high energy physics analysis |
topic | Computing and Computers Particle Physics - Experiment |
url | https://dx.doi.org/10.1007/s10586-022-03757-2 http://cds.cern.ch/record/2852823 |
work_keys_str_mv | AT padulanovincenzoeduardo acachingmechanismtoexploitobjectstorespeedinhighenergyphysicsanalysis AT saavedraenrictejedor acachingmechanismtoexploitobjectstorespeedinhighenergyphysicsanalysis AT alonsojordapedro acachingmechanismtoexploitobjectstorespeedinhighenergyphysicsanalysis AT gomezjavierlopez acachingmechanismtoexploitobjectstorespeedinhighenergyphysicsanalysis AT blomerjakob acachingmechanismtoexploitobjectstorespeedinhighenergyphysicsanalysis AT padulanovincenzoeduardo cachingmechanismtoexploitobjectstorespeedinhighenergyphysicsanalysis AT saavedraenrictejedor cachingmechanismtoexploitobjectstorespeedinhighenergyphysicsanalysis AT alonsojordapedro cachingmechanismtoexploitobjectstorespeedinhighenergyphysicsanalysis AT gomezjavierlopez cachingmechanismtoexploitobjectstorespeedinhighenergyphysicsanalysis AT blomerjakob cachingmechanismtoexploitobjectstorespeedinhighenergyphysicsanalysis |