Cargando…

Operational Analytics Studies for ATLAS Distributed Computing: Data Popularity Forecast and Ranking of the WLCG Centers

Operational analytics is the direction of research related to the analysis of the current state of computing processes and the prediction of the future in order to anticipate imbalances and take timely measures to stabilize a complex system. There are two relevant areas in ATLAS Distributed Computin...

Descripción completa

Detalles Bibliográficos
Autores principales: Grigoryeva, Maria, Klimentov, Alexei, Schulz, Markus, Shubin, Mikhail
Lenguaje:eng
Publicado: 2023
Materias:
Acceso en línea:http://cds.cern.ch/record/2858004
_version_ 1780977598103289856
author Grigoryeva, Maria
Klimentov, Alexei
Schulz, Markus
Shubin, Mikhail
author_facet Grigoryeva, Maria
Klimentov, Alexei
Schulz, Markus
Shubin, Mikhail
author_sort Grigoryeva, Maria
collection CERN
description Operational analytics is the direction of research related to the analysis of the current state of computing processes and the prediction of the future in order to anticipate imbalances and take timely measures to stabilize a complex system. There are two relevant areas in ATLAS Distributed Computing that are currently in the focus of studies: end-user physics analysis including the forecast of samples of data popularity among users, and ranking of WLCG centers for user analysis tasks. Studies in these areas are non-trivial and require detailed knowledge of all boundary conditions, which are usually very many in large-scale distributed computing infrastructures. Forecasts of data popularity are impossible without the categorization of user tasks by their types (data transformation or physics analysis), which do not always appear on the surface but may induce noise, which introduces significant distortions for predictive analysis. Ranking the WLCG resources is also a challenging task as it is necessary to find a balance between the workload of the resource, its performance, the waiting time for jobs on it, as well as the volume of jobs that it processes. This is especially difficult in a heterogeneous computing environment, where legacy resources are used along with modern high-performance machines. We will look at these areas of research in detail and discuss what tools and methods we use in our work, demonstrating the results that we already have. The difficulties we face and how we solve them will also be described.
id cern-2858004
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2023
record_format invenio
spelling cern-28580042023-05-07T19:02:07Zhttp://cds.cern.ch/record/2858004engGrigoryeva, MariaKlimentov, AlexeiSchulz, MarkusShubin, MikhailOperational Analytics Studies for ATLAS Distributed Computing: Data Popularity Forecast and Ranking of the WLCG CentersParticle Physics - ExperimentOperational analytics is the direction of research related to the analysis of the current state of computing processes and the prediction of the future in order to anticipate imbalances and take timely measures to stabilize a complex system. There are two relevant areas in ATLAS Distributed Computing that are currently in the focus of studies: end-user physics analysis including the forecast of samples of data popularity among users, and ranking of WLCG centers for user analysis tasks. Studies in these areas are non-trivial and require detailed knowledge of all boundary conditions, which are usually very many in large-scale distributed computing infrastructures. Forecasts of data popularity are impossible without the categorization of user tasks by their types (data transformation or physics analysis), which do not always appear on the surface but may induce noise, which introduces significant distortions for predictive analysis. Ranking the WLCG resources is also a challenging task as it is necessary to find a balance between the workload of the resource, its performance, the waiting time for jobs on it, as well as the volume of jobs that it processes. This is especially difficult in a heterogeneous computing environment, where legacy resources are used along with modern high-performance machines. We will look at these areas of research in detail and discuss what tools and methods we use in our work, demonstrating the results that we already have. The difficulties we face and how we solve them will also be described.ATL-SOFT-SLIDE-2023-167oai:cds.cern.ch:28580042023-05-07
spellingShingle Particle Physics - Experiment
Grigoryeva, Maria
Klimentov, Alexei
Schulz, Markus
Shubin, Mikhail
Operational Analytics Studies for ATLAS Distributed Computing: Data Popularity Forecast and Ranking of the WLCG Centers
title Operational Analytics Studies for ATLAS Distributed Computing: Data Popularity Forecast and Ranking of the WLCG Centers
title_full Operational Analytics Studies for ATLAS Distributed Computing: Data Popularity Forecast and Ranking of the WLCG Centers
title_fullStr Operational Analytics Studies for ATLAS Distributed Computing: Data Popularity Forecast and Ranking of the WLCG Centers
title_full_unstemmed Operational Analytics Studies for ATLAS Distributed Computing: Data Popularity Forecast and Ranking of the WLCG Centers
title_short Operational Analytics Studies for ATLAS Distributed Computing: Data Popularity Forecast and Ranking of the WLCG Centers
title_sort operational analytics studies for atlas distributed computing: data popularity forecast and ranking of the wlcg centers
topic Particle Physics - Experiment
url http://cds.cern.ch/record/2858004
work_keys_str_mv AT grigoryevamaria operationalanalyticsstudiesforatlasdistributedcomputingdatapopularityforecastandrankingofthewlcgcenters
AT klimentovalexei operationalanalyticsstudiesforatlasdistributedcomputingdatapopularityforecastandrankingofthewlcgcenters
AT schulzmarkus operationalanalyticsstudiesforatlasdistributedcomputingdatapopularityforecastandrankingofthewlcgcenters
AT shubinmikhail operationalanalyticsstudiesforatlasdistributedcomputingdatapopularityforecastandrankingofthewlcgcenters