Cargando…
Operational Analytics Studies for ATLAS Distributed Computing: Data Popularity Forecast and Ranking of the WLCG Centers
Operational analytics is the direction of research related to the analysis of the current state of computing processes and the prediction of the future in order to anticipate imbalances and take timely measures to stabilize a complex system. There are two relevant areas in ATLAS Distributed Computin...
Autores principales: | , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2023
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/2858004 |
_version_ | 1780977598103289856 |
---|---|
author | Grigoryeva, Maria Klimentov, Alexei Schulz, Markus Shubin, Mikhail |
author_facet | Grigoryeva, Maria Klimentov, Alexei Schulz, Markus Shubin, Mikhail |
author_sort | Grigoryeva, Maria |
collection | CERN |
description | Operational analytics is the direction of research related to the analysis of the current state of computing processes and the prediction of the future in order to anticipate imbalances and take timely measures to stabilize a complex system. There are two relevant areas in ATLAS Distributed Computing that are currently in the focus of studies: end-user physics analysis including the forecast of samples of data popularity among users, and ranking of WLCG centers for user analysis tasks. Studies in these areas are non-trivial and require detailed knowledge of all boundary conditions, which are usually very many in large-scale distributed computing infrastructures. Forecasts of data popularity are impossible without the categorization of user tasks by their types (data transformation or physics analysis), which do not always appear on the surface but may induce noise, which introduces significant distortions for predictive analysis. Ranking the WLCG resources is also a challenging task as it is necessary to find a balance between the workload of the resource, its performance, the waiting time for jobs on it, as well as the volume of jobs that it processes. This is especially difficult in a heterogeneous computing environment, where legacy resources are used along with modern high-performance machines. We will look at these areas of research in detail and discuss what tools and methods we use in our work, demonstrating the results that we already have. The difficulties we face and how we solve them will also be described. |
id | cern-2858004 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2023 |
record_format | invenio |
spelling | cern-28580042023-05-07T19:02:07Zhttp://cds.cern.ch/record/2858004engGrigoryeva, MariaKlimentov, AlexeiSchulz, MarkusShubin, MikhailOperational Analytics Studies for ATLAS Distributed Computing: Data Popularity Forecast and Ranking of the WLCG CentersParticle Physics - ExperimentOperational analytics is the direction of research related to the analysis of the current state of computing processes and the prediction of the future in order to anticipate imbalances and take timely measures to stabilize a complex system. There are two relevant areas in ATLAS Distributed Computing that are currently in the focus of studies: end-user physics analysis including the forecast of samples of data popularity among users, and ranking of WLCG centers for user analysis tasks. Studies in these areas are non-trivial and require detailed knowledge of all boundary conditions, which are usually very many in large-scale distributed computing infrastructures. Forecasts of data popularity are impossible without the categorization of user tasks by their types (data transformation or physics analysis), which do not always appear on the surface but may induce noise, which introduces significant distortions for predictive analysis. Ranking the WLCG resources is also a challenging task as it is necessary to find a balance between the workload of the resource, its performance, the waiting time for jobs on it, as well as the volume of jobs that it processes. This is especially difficult in a heterogeneous computing environment, where legacy resources are used along with modern high-performance machines. We will look at these areas of research in detail and discuss what tools and methods we use in our work, demonstrating the results that we already have. The difficulties we face and how we solve them will also be described.ATL-SOFT-SLIDE-2023-167oai:cds.cern.ch:28580042023-05-07 |
spellingShingle | Particle Physics - Experiment Grigoryeva, Maria Klimentov, Alexei Schulz, Markus Shubin, Mikhail Operational Analytics Studies for ATLAS Distributed Computing: Data Popularity Forecast and Ranking of the WLCG Centers |
title | Operational Analytics Studies for ATLAS Distributed Computing: Data Popularity Forecast and Ranking of the WLCG Centers |
title_full | Operational Analytics Studies for ATLAS Distributed Computing: Data Popularity Forecast and Ranking of the WLCG Centers |
title_fullStr | Operational Analytics Studies for ATLAS Distributed Computing: Data Popularity Forecast and Ranking of the WLCG Centers |
title_full_unstemmed | Operational Analytics Studies for ATLAS Distributed Computing: Data Popularity Forecast and Ranking of the WLCG Centers |
title_short | Operational Analytics Studies for ATLAS Distributed Computing: Data Popularity Forecast and Ranking of the WLCG Centers |
title_sort | operational analytics studies for atlas distributed computing: data popularity forecast and ranking of the wlcg centers |
topic | Particle Physics - Experiment |
url | http://cds.cern.ch/record/2858004 |
work_keys_str_mv | AT grigoryevamaria operationalanalyticsstudiesforatlasdistributedcomputingdatapopularityforecastandrankingofthewlcgcenters AT klimentovalexei operationalanalyticsstudiesforatlasdistributedcomputingdatapopularityforecastandrankingofthewlcgcenters AT schulzmarkus operationalanalyticsstudiesforatlasdistributedcomputingdatapopularityforecastandrankingofthewlcgcenters AT shubinmikhail operationalanalyticsstudiesforatlasdistributedcomputingdatapopularityforecastandrankingofthewlcgcenters |