Cargando…

The Future of PanDA in ATLAS Distributed Computing

Experiments at the Large Hadron Collider (LHC) face unprecedented computing challenges. Heterogeneous resources are distributed worldwide at hundreds of sites, thousands of physicists analyze the data remotely, the volume of processed data is beyond the exabyte scale, while data processing requires...

Descripción completa

Detalles Bibliográficos
Autores principales: De, Kaushik, Klimentov, Alexei, Maeno, Tadashi, Nilsson, Paul, Oleynik, Danila, Panitkin, Sergey, Petrosyan, Artem, Schovancova, Jaroslava, Vaniachine, Alexandre, Wenaus, Torre
Lenguaje:eng
Publicado: 2015
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/664/6/062035
http://cds.cern.ch/record/2016653
_version_ 1780946705354588160
author De, Kaushik
Klimentov, Alexei
Maeno, Tadashi
Nilsson, Paul
Oleynik, Danila
Panitkin, Sergey
Petrosyan, Artem
Schovancova, Jaroslava
Vaniachine, Alexandre
Wenaus, Torre
author_facet De, Kaushik
Klimentov, Alexei
Maeno, Tadashi
Nilsson, Paul
Oleynik, Danila
Panitkin, Sergey
Petrosyan, Artem
Schovancova, Jaroslava
Vaniachine, Alexandre
Wenaus, Torre
author_sort De, Kaushik
collection CERN
description Experiments at the Large Hadron Collider (LHC) face unprecedented computing challenges. Heterogeneous resources are distributed worldwide at hundreds of sites, thousands of physicists analyze the data remotely, the volume of processed data is beyond the exabyte scale, while data processing requires more than a few billion hours of computing usage per year. The PanDA (Production and Distributed Analysis) system was developed to meet the scale and complexity of LHC distributed computing for the ATLAS experiment. In the process, the old batch job paradigm of locally managed computing in HEP was discarded in favor of a far more automated, flexible and scalable model. The success of PanDA in ATLAS is leading to widespread adoption and testing by other experiments. PanDA is the first exascale workload management system in HEP, already operating at more than a million computing jobs per day, and processing over an exabyte of data in 2013. There are many new challenges that PanDA will face in the near future, in addition to new challenges of scale, heterogeneity and increasing user base. PanDA will need to handle rapidly changing computing infrastructure, will require factorization of code for easier deployment, will need to incorporate additional information sources including network metrics in decision making, be able to control network circuits, handle dynamically sized workload processing, provide improved visualization, and face many other challenges. In this talk we will focus on the new features, planned or recently implemented, that are relevant to the next decade of distributed computing workload management using PanDA.
id cern-2016653
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2015
record_format invenio
spelling cern-20166532022-08-10T12:53:17Zdoi:10.1088/1742-6596/664/6/062035http://cds.cern.ch/record/2016653engDe, KaushikKlimentov, AlexeiMaeno, TadashiNilsson, PaulOleynik, DanilaPanitkin, SergeyPetrosyan, ArtemSchovancova, JaroslavaVaniachine, AlexandreWenaus, TorreThe Future of PanDA in ATLAS Distributed ComputingParticle Physics - ExperimentExperiments at the Large Hadron Collider (LHC) face unprecedented computing challenges. Heterogeneous resources are distributed worldwide at hundreds of sites, thousands of physicists analyze the data remotely, the volume of processed data is beyond the exabyte scale, while data processing requires more than a few billion hours of computing usage per year. The PanDA (Production and Distributed Analysis) system was developed to meet the scale and complexity of LHC distributed computing for the ATLAS experiment. In the process, the old batch job paradigm of locally managed computing in HEP was discarded in favor of a far more automated, flexible and scalable model. The success of PanDA in ATLAS is leading to widespread adoption and testing by other experiments. PanDA is the first exascale workload management system in HEP, already operating at more than a million computing jobs per day, and processing over an exabyte of data in 2013. There are many new challenges that PanDA will face in the near future, in addition to new challenges of scale, heterogeneity and increasing user base. PanDA will need to handle rapidly changing computing infrastructure, will require factorization of code for easier deployment, will need to incorporate additional information sources including network metrics in decision making, be able to control network circuits, handle dynamically sized workload processing, provide improved visualization, and face many other challenges. In this talk we will focus on the new features, planned or recently implemented, that are relevant to the next decade of distributed computing workload management using PanDA.ATL-SOFT-PROC-2015-047oai:cds.cern.ch:20166532015-05-17
spellingShingle Particle Physics - Experiment
De, Kaushik
Klimentov, Alexei
Maeno, Tadashi
Nilsson, Paul
Oleynik, Danila
Panitkin, Sergey
Petrosyan, Artem
Schovancova, Jaroslava
Vaniachine, Alexandre
Wenaus, Torre
The Future of PanDA in ATLAS Distributed Computing
title The Future of PanDA in ATLAS Distributed Computing
title_full The Future of PanDA in ATLAS Distributed Computing
title_fullStr The Future of PanDA in ATLAS Distributed Computing
title_full_unstemmed The Future of PanDA in ATLAS Distributed Computing
title_short The Future of PanDA in ATLAS Distributed Computing
title_sort future of panda in atlas distributed computing
topic Particle Physics - Experiment
url https://dx.doi.org/10.1088/1742-6596/664/6/062035
http://cds.cern.ch/record/2016653
work_keys_str_mv AT dekaushik thefutureofpandainatlasdistributedcomputing
AT klimentovalexei thefutureofpandainatlasdistributedcomputing
AT maenotadashi thefutureofpandainatlasdistributedcomputing
AT nilssonpaul thefutureofpandainatlasdistributedcomputing
AT oleynikdanila thefutureofpandainatlasdistributedcomputing
AT panitkinsergey thefutureofpandainatlasdistributedcomputing
AT petrosyanartem thefutureofpandainatlasdistributedcomputing
AT schovancovajaroslava thefutureofpandainatlasdistributedcomputing
AT vaniachinealexandre thefutureofpandainatlasdistributedcomputing
AT wenaustorre thefutureofpandainatlasdistributedcomputing
AT dekaushik futureofpandainatlasdistributedcomputing
AT klimentovalexei futureofpandainatlasdistributedcomputing
AT maenotadashi futureofpandainatlasdistributedcomputing
AT nilssonpaul futureofpandainatlasdistributedcomputing
AT oleynikdanila futureofpandainatlasdistributedcomputing
AT panitkinsergey futureofpandainatlasdistributedcomputing
AT petrosyanartem futureofpandainatlasdistributedcomputing
AT schovancovajaroslava futureofpandainatlasdistributedcomputing
AT vaniachinealexandre futureofpandainatlasdistributedcomputing
AT wenaustorre futureofpandainatlasdistributedcomputing