Cargando…

Enabling Data Intensive Science on Supercomputers for High Energy Physics R&D; Projects in HL-LHC Era

The ATLAS experiment at CERN’s Large Hadron Collider uses the Worldwide LHC Computing Grid, the WLCG, for its distributed computing infrastructure. Through the workload management system PanDA and the distributed data management system Rucio, ATLAS provides seamless access to hundreds of WLCG grid a...

Descripción completa

Detalles Bibliográficos
Autores principales: Klimentov, Alexei, Benjamin, Douglas, Di Girolamo, Alessandro, De, Kaushik, Elmsheuser, Johannes, Filipcic, Andrej, Kiryanov, Andrey, Oleynik, Danila, Wells, Jack C, Zarochentsev, Andrey, Zhao, Xin
Lenguaje:eng
Publicado: 2020
Materias:
Acceso en línea:https://dx.doi.org/10.1051/epjconf/202022601007
http://cds.cern.ch/record/2714103
_version_ 1780965386608443392
author Klimentov, Alexei
Benjamin, Douglas
Di Girolamo, Alessandro
De, Kaushik
Elmsheuser, Johannes
Filipcic, Andrej
Kiryanov, Andrey
Oleynik, Danila
Wells, Jack C
Zarochentsev, Andrey
Zhao, Xin
author_facet Klimentov, Alexei
Benjamin, Douglas
Di Girolamo, Alessandro
De, Kaushik
Elmsheuser, Johannes
Filipcic, Andrej
Kiryanov, Andrey
Oleynik, Danila
Wells, Jack C
Zarochentsev, Andrey
Zhao, Xin
author_sort Klimentov, Alexei
collection CERN
description The ATLAS experiment at CERN’s Large Hadron Collider uses the Worldwide LHC Computing Grid, the WLCG, for its distributed computing infrastructure. Through the workload management system PanDA and the distributed data management system Rucio, ATLAS provides seamless access to hundreds of WLCG grid and cloud based resources that are distributed worldwide, to thousands of physicists. PanDA annually processes more than an exabyte of data using an average of 350,000 distributed batch slots, to enable hundreds of new scientific results from ATLAS. However, the resources available to the experiment have been insufficient to meet ATLAS simulation needs over the past few years as the volume of data from the LHC has grown. The problem willbe even more severe for the next LHC phases. High Luminosity LHC will be a multiexabyte challenge where the envisaged Storage and Compute needs are a factor 10 to 100 above the expected technology evolution. The High Energy Physics (HEP) community needs to evolve current computing and data organization models in order to introduce changes in the way it uses and manages the infrastructure, focused on optimizations to bring performance and efficiency not forgetting simplification of operations. In this paper we highlight recent R&D; projects in HEP related to data lake prototype, federated data storage and data carousel.
id oai-inspirehep.net-1776918
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2020
record_format invenio
spelling oai-inspirehep.net-17769182022-08-17T13:00:53Zdoi:10.1051/epjconf/202022601007http://cds.cern.ch/record/2714103engKlimentov, AlexeiBenjamin, DouglasDi Girolamo, AlessandroDe, KaushikElmsheuser, JohannesFilipcic, AndrejKiryanov, AndreyOleynik, DanilaWells, Jack CZarochentsev, AndreyZhao, XinEnabling Data Intensive Science on Supercomputers for High Energy Physics R&D; Projects in HL-LHC EraComputing and ComputersParticle Physics - ExperimentThe ATLAS experiment at CERN’s Large Hadron Collider uses the Worldwide LHC Computing Grid, the WLCG, for its distributed computing infrastructure. Through the workload management system PanDA and the distributed data management system Rucio, ATLAS provides seamless access to hundreds of WLCG grid and cloud based resources that are distributed worldwide, to thousands of physicists. PanDA annually processes more than an exabyte of data using an average of 350,000 distributed batch slots, to enable hundreds of new scientific results from ATLAS. However, the resources available to the experiment have been insufficient to meet ATLAS simulation needs over the past few years as the volume of data from the LHC has grown. The problem willbe even more severe for the next LHC phases. High Luminosity LHC will be a multiexabyte challenge where the envisaged Storage and Compute needs are a factor 10 to 100 above the expected technology evolution. The High Energy Physics (HEP) community needs to evolve current computing and data organization models in order to introduce changes in the way it uses and manages the infrastructure, focused on optimizations to bring performance and efficiency not forgetting simplification of operations. In this paper we highlight recent R&D; projects in HEP related to data lake prototype, federated data storage and data carousel.oai:inspirehep.net:17769182020
spellingShingle Computing and Computers
Particle Physics - Experiment
Klimentov, Alexei
Benjamin, Douglas
Di Girolamo, Alessandro
De, Kaushik
Elmsheuser, Johannes
Filipcic, Andrej
Kiryanov, Andrey
Oleynik, Danila
Wells, Jack C
Zarochentsev, Andrey
Zhao, Xin
Enabling Data Intensive Science on Supercomputers for High Energy Physics R&D; Projects in HL-LHC Era
title Enabling Data Intensive Science on Supercomputers for High Energy Physics R&D; Projects in HL-LHC Era
title_full Enabling Data Intensive Science on Supercomputers for High Energy Physics R&D; Projects in HL-LHC Era
title_fullStr Enabling Data Intensive Science on Supercomputers for High Energy Physics R&D; Projects in HL-LHC Era
title_full_unstemmed Enabling Data Intensive Science on Supercomputers for High Energy Physics R&D; Projects in HL-LHC Era
title_short Enabling Data Intensive Science on Supercomputers for High Energy Physics R&D; Projects in HL-LHC Era
title_sort enabling data intensive science on supercomputers for high energy physics r&d; projects in hl-lhc era
topic Computing and Computers
Particle Physics - Experiment
url https://dx.doi.org/10.1051/epjconf/202022601007
http://cds.cern.ch/record/2714103
work_keys_str_mv AT klimentovalexei enablingdataintensivescienceonsupercomputersforhighenergyphysicsrdprojectsinhllhcera
AT benjamindouglas enablingdataintensivescienceonsupercomputersforhighenergyphysicsrdprojectsinhllhcera
AT digirolamoalessandro enablingdataintensivescienceonsupercomputersforhighenergyphysicsrdprojectsinhllhcera
AT dekaushik enablingdataintensivescienceonsupercomputersforhighenergyphysicsrdprojectsinhllhcera
AT elmsheuserjohannes enablingdataintensivescienceonsupercomputersforhighenergyphysicsrdprojectsinhllhcera
AT filipcicandrej enablingdataintensivescienceonsupercomputersforhighenergyphysicsrdprojectsinhllhcera
AT kiryanovandrey enablingdataintensivescienceonsupercomputersforhighenergyphysicsrdprojectsinhllhcera
AT oleynikdanila enablingdataintensivescienceonsupercomputersforhighenergyphysicsrdprojectsinhllhcera
AT wellsjackc enablingdataintensivescienceonsupercomputersforhighenergyphysicsrdprojectsinhllhcera
AT zarochentsevandrey enablingdataintensivescienceonsupercomputersforhighenergyphysicsrdprojectsinhllhcera
AT zhaoxin enablingdataintensivescienceonsupercomputersforhighenergyphysicsrdprojectsinhllhcera