Cargando…
Integrated automation for configuration management and operations in the ATLAS online computing farm
The online farm of the ATLAS experiment at the LHC, consisting of nearly 4000 PCs with various characteristics, provides configuration and control of the detector and performs the collection, processing, selection, and conveyance of event data from the front-end electronics to mass storage. Differen...
Autores principales: | , , , , , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2018
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1051/epjconf/201921408022 http://cds.cern.ch/record/2649073 |
_version_ | 1780960712834678784 |
---|---|
author | Amirkhanov, Artem Ballestrero, Sergio Brasolin, Franco Lee, Christopher Jon Du Plessis, Haydn Dean Mitrogeorgos, Konstantinos Pernigotti, Marco Sanchez Pineda, Arturo Rodolfo Scannicchio, Diana Alessandra Twomey, Matthew Shaun |
author_facet | Amirkhanov, Artem Ballestrero, Sergio Brasolin, Franco Lee, Christopher Jon Du Plessis, Haydn Dean Mitrogeorgos, Konstantinos Pernigotti, Marco Sanchez Pineda, Arturo Rodolfo Scannicchio, Diana Alessandra Twomey, Matthew Shaun |
author_sort | Amirkhanov, Artem |
collection | CERN |
description | The online farm of the ATLAS experiment at the LHC, consisting of nearly 4000 PCs with various characteristics, provides configuration and control of the detector and performs the collection, processing, selection, and conveyance of event data from the front-end electronics to mass storage. Different aspects of the farm management are already accessible via several tools. The status and health of each node are monitored by a system based on Icinga 2 and Ganglia. PuppetDB gathers centrally all the status information from Puppet, the configuration management tool used to ensure configuration consistency of every node. The in-house Configuration Database (ConfDB) controls DHCP and PXE, while also integrating external information sources. In these proceedings we present our roadmap for integrating these and other data sources and systems, and building a higher level of abstraction on top of this foundation. An automation and orchestration tool will be able to use these systems and replace lengthy manual procedures, some of which also require interactions with other systems and teams, e.g. for the repair of a faulty node. Finally, an inventory and tracking system will complement the available data sources, keep track of node history, and improve the evaluation of long-term lifecycle management and purchase strategies. |
id | cern-2649073 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2018 |
record_format | invenio |
spelling | cern-26490732022-08-10T12:24:17Zdoi:10.1051/epjconf/201921408022http://cds.cern.ch/record/2649073engAmirkhanov, ArtemBallestrero, SergioBrasolin, FrancoLee, Christopher JonDu Plessis, Haydn DeanMitrogeorgos, KonstantinosPernigotti, MarcoSanchez Pineda, Arturo RodolfoScannicchio, Diana AlessandraTwomey, Matthew ShaunIntegrated automation for configuration management and operations in the ATLAS online computing farmParticle Physics - ExperimentThe online farm of the ATLAS experiment at the LHC, consisting of nearly 4000 PCs with various characteristics, provides configuration and control of the detector and performs the collection, processing, selection, and conveyance of event data from the front-end electronics to mass storage. Different aspects of the farm management are already accessible via several tools. The status and health of each node are monitored by a system based on Icinga 2 and Ganglia. PuppetDB gathers centrally all the status information from Puppet, the configuration management tool used to ensure configuration consistency of every node. The in-house Configuration Database (ConfDB) controls DHCP and PXE, while also integrating external information sources. In these proceedings we present our roadmap for integrating these and other data sources and systems, and building a higher level of abstraction on top of this foundation. An automation and orchestration tool will be able to use these systems and replace lengthy manual procedures, some of which also require interactions with other systems and teams, e.g. for the repair of a faulty node. Finally, an inventory and tracking system will complement the available data sources, keep track of node history, and improve the evaluation of long-term lifecycle management and purchase strategies.ATL-DAQ-PROC-2018-038oai:cds.cern.ch:26490732018-11-27 |
spellingShingle | Particle Physics - Experiment Amirkhanov, Artem Ballestrero, Sergio Brasolin, Franco Lee, Christopher Jon Du Plessis, Haydn Dean Mitrogeorgos, Konstantinos Pernigotti, Marco Sanchez Pineda, Arturo Rodolfo Scannicchio, Diana Alessandra Twomey, Matthew Shaun Integrated automation for configuration management and operations in the ATLAS online computing farm |
title | Integrated automation for configuration management and operations in the ATLAS online computing farm |
title_full | Integrated automation for configuration management and operations in the ATLAS online computing farm |
title_fullStr | Integrated automation for configuration management and operations in the ATLAS online computing farm |
title_full_unstemmed | Integrated automation for configuration management and operations in the ATLAS online computing farm |
title_short | Integrated automation for configuration management and operations in the ATLAS online computing farm |
title_sort | integrated automation for configuration management and operations in the atlas online computing farm |
topic | Particle Physics - Experiment |
url | https://dx.doi.org/10.1051/epjconf/201921408022 http://cds.cern.ch/record/2649073 |
work_keys_str_mv | AT amirkhanovartem integratedautomationforconfigurationmanagementandoperationsintheatlasonlinecomputingfarm AT ballestrerosergio integratedautomationforconfigurationmanagementandoperationsintheatlasonlinecomputingfarm AT brasolinfranco integratedautomationforconfigurationmanagementandoperationsintheatlasonlinecomputingfarm AT leechristopherjon integratedautomationforconfigurationmanagementandoperationsintheatlasonlinecomputingfarm AT duplessishaydndean integratedautomationforconfigurationmanagementandoperationsintheatlasonlinecomputingfarm AT mitrogeorgoskonstantinos integratedautomationforconfigurationmanagementandoperationsintheatlasonlinecomputingfarm AT pernigottimarco integratedautomationforconfigurationmanagementandoperationsintheatlasonlinecomputingfarm AT sanchezpinedaarturorodolfo integratedautomationforconfigurationmanagementandoperationsintheatlasonlinecomputingfarm AT scannicchiodianaalessandra integratedautomationforconfigurationmanagementandoperationsintheatlasonlinecomputingfarm AT twomeymatthewshaun integratedautomationforconfigurationmanagementandoperationsintheatlasonlinecomputingfarm |