Cargando…

Reinforcement Learning for Smart Caching at the CMS experiment

In the near future, High Energy Physics experiments’ storage and computing needs will go far above what can be achieved by only scaling current computing models or current infrastructures. Considering the LHC case, for 10 years a federated infrastructure (Worldwide LHC Computing Grid, WLCG) has been...

Descripción completa

Detalles Bibliográficos
Autores principales:	Tedeschi, Tommaso, Tracolli, Mirco, Ciangottini, Diego, Spiga, Daniele, Storchi, Loriano, Baioletti, Marco, Poggioni, Valentina
Lenguaje:	eng
Publicado:	2021
Materias:	Computing and Computers
Acceso en línea:	https://dx.doi.org/10.22323/1.378.0009 http://cds.cern.ch/record/2816665

_version_	1780973612279267328
author	Tedeschi, Tommaso Tracolli, Mirco Ciangottini, Diego Spiga, Daniele Storchi, Loriano Baioletti, Marco Poggioni, Valentina
author_facet	Tedeschi, Tommaso Tracolli, Mirco Ciangottini, Diego Spiga, Daniele Storchi, Loriano Baioletti, Marco Poggioni, Valentina
author_sort	Tedeschi, Tommaso
collection	CERN
description	In the near future, High Energy Physics experiments’ storage and computing needs will go far above what can be achieved by only scaling current computing models or current infrastructures. Considering the LHC case, for 10 years a federated infrastructure (Worldwide LHC Computing Grid, WLCG) has been successfully developed. Nevertheless, the High Luminosity (HL-LHC) scenario is forcing the WLCG community to dig for innovative solutions. In this landscape, one of the initiatives is the exploitation of Data Lakes as a solution to improve the Data and Storage management. The current Data Lake model foresees data caching to play a central role as a technical solution to reduce the impact of latency and network load. Moreover, even higher efficiency can be achieved through a smart caching algorithm: this motivates the development of an AI-based approach to the caching problem. In this work, a Reinforcement Learning-based cache model (named QCACHE) is applied in the CMS experiment context. More specifically, we focused our attention on the optimization of both cache performances and cache management costs. The QCACHE system is based on two distinct Q-Learning (or Deep Q-Learning) agents seeking to find the best action to take given the current state. More explicitly, they try to learn a policy that maximizes the total reward (i.e. hit or miss occurring in a given time span). While the addition Agent is taking care of all the cache writing requests, clearly the eviction agent deals with the decision to keep or to delete files in the cache. We will present an overview of the QCACHE framework an the results in terms of cache performances, obtained using using “Real-world” data, will be compared respect to standard replacement policies (i.e. we used historical data requests aggregation used to predict dataset popularity filtered for Italian region). Moreover, we will show the planned subsequent evolution of the framework.
id	cern-2816665
institution	Organización Europea para la Investigación Nuclear
language	eng
publishDate	2021
record_format	invenio
spelling	cern-28166652022-07-27T08:30:54Zdoi:10.22323/1.378.0009http://cds.cern.ch/record/2816665engTedeschi, TommasoTracolli, MircoCiangottini, DiegoSpiga, DanieleStorchi, LorianoBaioletti, MarcoPoggioni, ValentinaReinforcement Learning for Smart Caching at the CMS experimentComputing and ComputersIn the near future, High Energy Physics experiments’ storage and computing needs will go far above what can be achieved by only scaling current computing models or current infrastructures. Considering the LHC case, for 10 years a federated infrastructure (Worldwide LHC Computing Grid, WLCG) has been successfully developed. Nevertheless, the High Luminosity (HL-LHC) scenario is forcing the WLCG community to dig for innovative solutions. In this landscape, one of the initiatives is the exploitation of Data Lakes as a solution to improve the Data and Storage management. The current Data Lake model foresees data caching to play a central role as a technical solution to reduce the impact of latency and network load. Moreover, even higher efficiency can be achieved through a smart caching algorithm: this motivates the development of an AI-based approach to the caching problem. In this work, a Reinforcement Learning-based cache model (named QCACHE) is applied in the CMS experiment context. More specifically, we focused our attention on the optimization of both cache performances and cache management costs. The QCACHE system is based on two distinct Q-Learning (or Deep Q-Learning) agents seeking to find the best action to take given the current state. More explicitly, they try to learn a policy that maximizes the total reward (i.e. hit or miss occurring in a given time span). While the addition Agent is taking care of all the cache writing requests, clearly the eviction agent deals with the decision to keep or to delete files in the cache. We will present an overview of the QCACHE framework an the results in terms of cache performances, obtained using using “Real-world” data, will be compared respect to standard replacement policies (i.e. we used historical data requests aggregation used to predict dataset popularity filtered for Italian region). Moreover, we will show the planned subsequent evolution of the framework.oai:cds.cern.ch:28166652021
spellingShingle	Computing and Computers Tedeschi, Tommaso Tracolli, Mirco Ciangottini, Diego Spiga, Daniele Storchi, Loriano Baioletti, Marco Poggioni, Valentina Reinforcement Learning for Smart Caching at the CMS experiment
title	Reinforcement Learning for Smart Caching at the CMS experiment
title_full	Reinforcement Learning for Smart Caching at the CMS experiment
title_fullStr	Reinforcement Learning for Smart Caching at the CMS experiment
title_full_unstemmed	Reinforcement Learning for Smart Caching at the CMS experiment
title_short	Reinforcement Learning for Smart Caching at the CMS experiment
title_sort	reinforcement learning for smart caching at the cms experiment
topic	Computing and Computers
url	https://dx.doi.org/10.22323/1.378.0009 http://cds.cern.ch/record/2816665
work_keys_str_mv	AT tedeschitommaso reinforcementlearningforsmartcachingatthecmsexperiment AT tracollimirco reinforcementlearningforsmartcachingatthecmsexperiment AT ciangottinidiego reinforcementlearningforsmartcachingatthecmsexperiment AT spigadaniele reinforcementlearningforsmartcachingatthecmsexperiment AT storchiloriano reinforcementlearningforsmartcachingatthecmsexperiment AT baiolettimarco reinforcementlearningforsmartcachingatthecmsexperiment AT poggionivalentina reinforcementlearningforsmartcachingatthecmsexperiment

Reinforcement Learning for Smart Caching at the CMS experiment

Ejemplares similares