Cargando…

hep_tables: Heterogeneous Array Programming for HEP

<!--HTML-->Array operations are one of the most concise ways of expressing common filtering and simple aggregation operations that is the hallmark of the first step of a particle physics analysis: selection, filtering, basic vector operations, and filling histograms. The High Luminosity run of...

Descripción completa

Detalles Bibliográficos
Autor principal: Watts, Gordon
Lenguaje:eng
Publicado: 2021
Materias:
Acceso en línea:http://cds.cern.ch/record/2767252
_version_ 1780971283197984768
author Watts, Gordon
author_facet Watts, Gordon
author_sort Watts, Gordon
collection CERN
description <!--HTML-->Array operations are one of the most concise ways of expressing common filtering and simple aggregation operations that is the hallmark of the first step of a particle physics analysis: selection, filtering, basic vector operations, and filling histograms. The High Luminosity run of the Large Hadron Collider (HL-LHC), scheduled to start in 2026, will require physicists to regularly skim datasets that are over a PB in size, and repeatedly run over datasets that are 100's of TB's – too big to fit in memory. Declarative programming techniques are a way of separating the intent of the physicist from the mechanics of finding the data, processing the data, and using distributed computing to process it efficiently that is required to extract the plot or data desired in a timely fashion. This paper describes a prototype library that provides a framework for different sub-systems to cooperate in producing this data, using an array-programming declarative interface. This prototype has a servicex data-delivery sub-system and an \awkward array sub-system cooperating to generate requested data. The ServiceX system runs against ATLAS xAOD data.
id cern-2767252
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2021
record_format invenio
spelling cern-27672522022-11-02T22:25:37Zhttp://cds.cern.ch/record/2767252engWatts, Gordonhep_tables: Heterogeneous Array Programming for HEP25th International Conference on Computing in High Energy & Nuclear PhysicsConferences<!--HTML-->Array operations are one of the most concise ways of expressing common filtering and simple aggregation operations that is the hallmark of the first step of a particle physics analysis: selection, filtering, basic vector operations, and filling histograms. The High Luminosity run of the Large Hadron Collider (HL-LHC), scheduled to start in 2026, will require physicists to regularly skim datasets that are over a PB in size, and repeatedly run over datasets that are 100's of TB's – too big to fit in memory. Declarative programming techniques are a way of separating the intent of the physicist from the mechanics of finding the data, processing the data, and using distributed computing to process it efficiently that is required to extract the plot or data desired in a timely fashion. This paper describes a prototype library that provides a framework for different sub-systems to cooperate in producing this data, using an array-programming declarative interface. This prototype has a servicex data-delivery sub-system and an \awkward array sub-system cooperating to generate requested data. The ServiceX system runs against ATLAS xAOD data.oai:cds.cern.ch:27672522021
spellingShingle Conferences
Watts, Gordon
hep_tables: Heterogeneous Array Programming for HEP
title hep_tables: Heterogeneous Array Programming for HEP
title_full hep_tables: Heterogeneous Array Programming for HEP
title_fullStr hep_tables: Heterogeneous Array Programming for HEP
title_full_unstemmed hep_tables: Heterogeneous Array Programming for HEP
title_short hep_tables: Heterogeneous Array Programming for HEP
title_sort hep_tables: heterogeneous array programming for hep
topic Conferences
url http://cds.cern.ch/record/2767252
work_keys_str_mv AT wattsgordon heptablesheterogeneousarrayprogrammingforhep
AT wattsgordon 25thinternationalconferenceoncomputinginhighenergynuclearphysics