Cargando…
Predicting ALICE Grid throughput using recurrent neural networks
The Worldwide LHC Computing Grid (WLCG) is the infrastructure enabling the storage and processing of the large amount of data generated by the LHC experiments, and in particular the ALICE experiment. With the foreseen increase in the computing requirements of the future High Luminosity LHC experimen...
Autores principales: | , , |
---|---|
Lenguaje: | eng |
Publicado: |
2023
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1088/1742-6596/2438/1/012059 http://cds.cern.ch/record/2869677 |
_version_ | 1780978298367508480 |
---|---|
author | Popa, Mircea Grigoras, Costin Vallecorsa, Sofia |
author_facet | Popa, Mircea Grigoras, Costin Vallecorsa, Sofia |
author_sort | Popa, Mircea |
collection | CERN |
description | The Worldwide LHC Computing Grid (WLCG) is the infrastructure enabling the storage and processing of the large amount of data generated by the LHC experiments, and in particular the ALICE experiment. With the foreseen increase in the computing requirements of the future High Luminosity LHC experiments, a data placement strategy which increases the efficiency of the WLCG computing infrastructure becomes extremely relevant for the scientific success of the LHC scientific programme. Currently, the data placement at the ALICE Grid computing sites is determined by heuristic algorithms. Optimisation of the data storage could yield substantial benefits in terms of efficiency and time-to-result. This has however proven to be arduous due to the complexity of the problem. In this work we propose a modelisation of the behaviour of the system via principal component analysis, time series analysis and deep learning, starting from the detailed data collected by the MonALISA monitoring system. We show that it is possible to analyse and model the throughput of the ALICE Grid to a level that has not been possible before, comparing the performance of different deep learning architectures based on recurrent neural networks. Analyzing about six weeks of activity, the Grid I/O throughput trend is successfully predicted with a mean relative error of 4%, while the prediction of the throughput itself performs at 5%. |
id | cern-2869677 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2023 |
record_format | invenio |
spelling | cern-28696772023-09-08T19:08:30Zdoi:10.1088/1742-6596/2438/1/012059http://cds.cern.ch/record/2869677engPopa, MirceaGrigoras, CostinVallecorsa, SofiaPredicting ALICE Grid throughput using recurrent neural networksNuclear Physics - ExperimentParticle Physics - ExperimentComputing and ComputersThe Worldwide LHC Computing Grid (WLCG) is the infrastructure enabling the storage and processing of the large amount of data generated by the LHC experiments, and in particular the ALICE experiment. With the foreseen increase in the computing requirements of the future High Luminosity LHC experiments, a data placement strategy which increases the efficiency of the WLCG computing infrastructure becomes extremely relevant for the scientific success of the LHC scientific programme. Currently, the data placement at the ALICE Grid computing sites is determined by heuristic algorithms. Optimisation of the data storage could yield substantial benefits in terms of efficiency and time-to-result. This has however proven to be arduous due to the complexity of the problem. In this work we propose a modelisation of the behaviour of the system via principal component analysis, time series analysis and deep learning, starting from the detailed data collected by the MonALISA monitoring system. We show that it is possible to analyse and model the throughput of the ALICE Grid to a level that has not been possible before, comparing the performance of different deep learning architectures based on recurrent neural networks. Analyzing about six weeks of activity, the Grid I/O throughput trend is successfully predicted with a mean relative error of 4%, while the prediction of the throughput itself performs at 5%.oai:cds.cern.ch:28696772023 |
spellingShingle | Nuclear Physics - Experiment Particle Physics - Experiment Computing and Computers Popa, Mircea Grigoras, Costin Vallecorsa, Sofia Predicting ALICE Grid throughput using recurrent neural networks |
title | Predicting ALICE Grid throughput using recurrent neural networks |
title_full | Predicting ALICE Grid throughput using recurrent neural networks |
title_fullStr | Predicting ALICE Grid throughput using recurrent neural networks |
title_full_unstemmed | Predicting ALICE Grid throughput using recurrent neural networks |
title_short | Predicting ALICE Grid throughput using recurrent neural networks |
title_sort | predicting alice grid throughput using recurrent neural networks |
topic | Nuclear Physics - Experiment Particle Physics - Experiment Computing and Computers |
url | https://dx.doi.org/10.1088/1742-6596/2438/1/012059 http://cds.cern.ch/record/2869677 |
work_keys_str_mv | AT popamircea predictingalicegridthroughputusingrecurrentneuralnetworks AT grigorascostin predictingalicegridthroughputusingrecurrentneuralnetworks AT vallecorsasofia predictingalicegridthroughputusingrecurrentneuralnetworks |