Cargando…

Predicting ALICE Grid throughput using recurrent neural networks

The Worldwide LHC Computing Grid (WLCG) is the infrastructure enabling the storage and processing of the large amount of data generated by the LHC experiments, and in particular the ALICE experiment. With the foreseen increase in the computing requirements of the future High Luminosity LHC experimen...

Descripción completa

Detalles Bibliográficos
Autores principales: Popa, Mircea, Grigoras, Costin, Vallecorsa, Sofia
Lenguaje:eng
Publicado: 2023
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/2438/1/012059
http://cds.cern.ch/record/2869677
_version_ 1780978298367508480
author Popa, Mircea
Grigoras, Costin
Vallecorsa, Sofia
author_facet Popa, Mircea
Grigoras, Costin
Vallecorsa, Sofia
author_sort Popa, Mircea
collection CERN
description The Worldwide LHC Computing Grid (WLCG) is the infrastructure enabling the storage and processing of the large amount of data generated by the LHC experiments, and in particular the ALICE experiment. With the foreseen increase in the computing requirements of the future High Luminosity LHC experiments, a data placement strategy which increases the efficiency of the WLCG computing infrastructure becomes extremely relevant for the scientific success of the LHC scientific programme. Currently, the data placement at the ALICE Grid computing sites is determined by heuristic algorithms. Optimisation of the data storage could yield substantial benefits in terms of efficiency and time-to-result. This has however proven to be arduous due to the complexity of the problem. In this work we propose a modelisation of the behaviour of the system via principal component analysis, time series analysis and deep learning, starting from the detailed data collected by the MonALISA monitoring system. We show that it is possible to analyse and model the throughput of the ALICE Grid to a level that has not been possible before, comparing the performance of different deep learning architectures based on recurrent neural networks. Analyzing about six weeks of activity, the Grid I/O throughput trend is successfully predicted with a mean relative error of 4%, while the prediction of the throughput itself performs at 5%.
id cern-2869677
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2023
record_format invenio
spelling cern-28696772023-09-08T19:08:30Zdoi:10.1088/1742-6596/2438/1/012059http://cds.cern.ch/record/2869677engPopa, MirceaGrigoras, CostinVallecorsa, SofiaPredicting ALICE Grid throughput using recurrent neural networksNuclear Physics - ExperimentParticle Physics - ExperimentComputing and ComputersThe Worldwide LHC Computing Grid (WLCG) is the infrastructure enabling the storage and processing of the large amount of data generated by the LHC experiments, and in particular the ALICE experiment. With the foreseen increase in the computing requirements of the future High Luminosity LHC experiments, a data placement strategy which increases the efficiency of the WLCG computing infrastructure becomes extremely relevant for the scientific success of the LHC scientific programme. Currently, the data placement at the ALICE Grid computing sites is determined by heuristic algorithms. Optimisation of the data storage could yield substantial benefits in terms of efficiency and time-to-result. This has however proven to be arduous due to the complexity of the problem. In this work we propose a modelisation of the behaviour of the system via principal component analysis, time series analysis and deep learning, starting from the detailed data collected by the MonALISA monitoring system. We show that it is possible to analyse and model the throughput of the ALICE Grid to a level that has not been possible before, comparing the performance of different deep learning architectures based on recurrent neural networks. Analyzing about six weeks of activity, the Grid I/O throughput trend is successfully predicted with a mean relative error of 4%, while the prediction of the throughput itself performs at 5%.oai:cds.cern.ch:28696772023
spellingShingle Nuclear Physics - Experiment
Particle Physics - Experiment
Computing and Computers
Popa, Mircea
Grigoras, Costin
Vallecorsa, Sofia
Predicting ALICE Grid throughput using recurrent neural networks
title Predicting ALICE Grid throughput using recurrent neural networks
title_full Predicting ALICE Grid throughput using recurrent neural networks
title_fullStr Predicting ALICE Grid throughput using recurrent neural networks
title_full_unstemmed Predicting ALICE Grid throughput using recurrent neural networks
title_short Predicting ALICE Grid throughput using recurrent neural networks
title_sort predicting alice grid throughput using recurrent neural networks
topic Nuclear Physics - Experiment
Particle Physics - Experiment
Computing and Computers
url https://dx.doi.org/10.1088/1742-6596/2438/1/012059
http://cds.cern.ch/record/2869677
work_keys_str_mv AT popamircea predictingalicegridthroughputusingrecurrentneuralnetworks
AT grigorascostin predictingalicegridthroughputusingrecurrentneuralnetworks
AT vallecorsasofia predictingalicegridthroughputusingrecurrentneuralnetworks