Cargando…
Application of reinforcement learning in the LHC tune feedback
The Beam-Based Feedback System (BBFS) was primarily responsible for correcting the beam energy, orbit and tune in the CERN Large Hadron Collider (LHC). A major code renovation of the BBFS was planned and carried out during the LHC Long Shutdown 2 (LS2). This work consists of an explorative study to...
Autores principales: | , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2022
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.3389/fphy.2022.929064 http://cds.cern.ch/record/2836033 |
_version_ | 1780975718603161600 |
---|---|
author | Grech, Leander Valentino, Gianluca Alves, Diogo Hirlaender, Simon |
author_facet | Grech, Leander Valentino, Gianluca Alves, Diogo Hirlaender, Simon |
author_sort | Grech, Leander |
collection | CERN |
description | The Beam-Based Feedback System (BBFS) was primarily responsible for correcting the beam energy, orbit and tune in the CERN Large Hadron Collider (LHC). A major code renovation of the BBFS was planned and carried out during the LHC Long Shutdown 2 (LS2). This work consists of an explorative study to solve a beam-based control problem, the tune feedback (QFB), utilising state-of-the-art Reinforcement Learning (RL). A simulation environment was created to mimic the operation of the QFB. A series of RL agents were trained, and the best-performing agents were 9 then subjected to a set of well-designed tests. The original feedback controller used in the QFB was reimplemented to compare the performance of the classical approach to the performance of selected RL agents in the test scenarios. Results from the simulated environment show that the RL agent performance can exceed the controller-based paradigm. |
id | cern-2836033 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2022 |
record_format | invenio |
spelling | cern-28360332022-10-07T21:54:35Zdoi:10.3389/fphy.2022.929064http://cds.cern.ch/record/2836033engGrech, LeanderValentino, GianlucaAlves, DiogoHirlaender, SimonApplication of reinforcement learning in the LHC tune feedbackAccelerators and Storage RingsThe Beam-Based Feedback System (BBFS) was primarily responsible for correcting the beam energy, orbit and tune in the CERN Large Hadron Collider (LHC). A major code renovation of the BBFS was planned and carried out during the LHC Long Shutdown 2 (LS2). This work consists of an explorative study to solve a beam-based control problem, the tune feedback (QFB), utilising state-of-the-art Reinforcement Learning (RL). A simulation environment was created to mimic the operation of the QFB. A series of RL agents were trained, and the best-performing agents were 9 then subjected to a set of well-designed tests. The original feedback controller used in the QFB was reimplemented to compare the performance of the classical approach to the performance of selected RL agents in the test scenarios. Results from the simulated environment show that the RL agent performance can exceed the controller-based paradigm.oai:cds.cern.ch:28360332022 |
spellingShingle | Accelerators and Storage Rings Grech, Leander Valentino, Gianluca Alves, Diogo Hirlaender, Simon Application of reinforcement learning in the LHC tune feedback |
title | Application of reinforcement learning in the LHC tune feedback |
title_full | Application of reinforcement learning in the LHC tune feedback |
title_fullStr | Application of reinforcement learning in the LHC tune feedback |
title_full_unstemmed | Application of reinforcement learning in the LHC tune feedback |
title_short | Application of reinforcement learning in the LHC tune feedback |
title_sort | application of reinforcement learning in the lhc tune feedback |
topic | Accelerators and Storage Rings |
url | https://dx.doi.org/10.3389/fphy.2022.929064 http://cds.cern.ch/record/2836033 |
work_keys_str_mv | AT grechleander applicationofreinforcementlearninginthelhctunefeedback AT valentinogianluca applicationofreinforcementlearninginthelhctunefeedback AT alvesdiogo applicationofreinforcementlearninginthelhctunefeedback AT hirlaendersimon applicationofreinforcementlearninginthelhctunefeedback |