Cargando…

Integrated automation for configuration management and operations in the ATLAS online computing farm

The online farm of the ATLAS experiment at the LHC, consisting of nearly 4000 PCs with various characteristics, provides configuration and control of the detector and performs the collection, processing, selection, and conveyance of event data from the front-end electronics to mass storage. Differen...

Descripción completa

Detalles Bibliográficos
Autores principales: Amirkhanov, Artem, Ballestrero, Sergio, Brasolin, Franco, Lee, Christopher Jon, Du Plessis, Haydn Dean, Mitrogeorgos, Konstantinos, Pernigotti, Marco, Sanchez Pineda, Arturo Rodolfo, Scannicchio, Diana Alessandra, Twomey, Matthew Shaun
Lenguaje:eng
Publicado: 2018
Materias:
Acceso en línea:https://dx.doi.org/10.1051/epjconf/201921408022
http://cds.cern.ch/record/2649073
_version_ 1780960712834678784
author Amirkhanov, Artem
Ballestrero, Sergio
Brasolin, Franco
Lee, Christopher Jon
Du Plessis, Haydn Dean
Mitrogeorgos, Konstantinos
Pernigotti, Marco
Sanchez Pineda, Arturo Rodolfo
Scannicchio, Diana Alessandra
Twomey, Matthew Shaun
author_facet Amirkhanov, Artem
Ballestrero, Sergio
Brasolin, Franco
Lee, Christopher Jon
Du Plessis, Haydn Dean
Mitrogeorgos, Konstantinos
Pernigotti, Marco
Sanchez Pineda, Arturo Rodolfo
Scannicchio, Diana Alessandra
Twomey, Matthew Shaun
author_sort Amirkhanov, Artem
collection CERN
description The online farm of the ATLAS experiment at the LHC, consisting of nearly 4000 PCs with various characteristics, provides configuration and control of the detector and performs the collection, processing, selection, and conveyance of event data from the front-end electronics to mass storage. Different aspects of the farm management are already accessible via several tools. The status and health of each node are monitored by a system based on Icinga 2 and Ganglia. PuppetDB gathers centrally all the status information from Puppet, the configuration management tool used to ensure configuration consistency of every node. The in-house Configuration Database (ConfDB) controls DHCP and PXE, while also integrating external information sources. In these proceedings we present our roadmap for integrating these and other data sources and systems, and building a higher level of abstraction on top of this foundation. An automation and orchestration tool will be able to use these systems and replace lengthy manual procedures, some of which also require interactions with other systems and teams, e.g. for the repair of a faulty node. Finally, an inventory and tracking system will complement the available data sources, keep track of node history, and improve the evaluation of long-term lifecycle management and purchase strategies.
id cern-2649073
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2018
record_format invenio
spelling cern-26490732022-08-10T12:24:17Zdoi:10.1051/epjconf/201921408022http://cds.cern.ch/record/2649073engAmirkhanov, ArtemBallestrero, SergioBrasolin, FrancoLee, Christopher JonDu Plessis, Haydn DeanMitrogeorgos, KonstantinosPernigotti, MarcoSanchez Pineda, Arturo RodolfoScannicchio, Diana AlessandraTwomey, Matthew ShaunIntegrated automation for configuration management and operations in the ATLAS online computing farmParticle Physics - ExperimentThe online farm of the ATLAS experiment at the LHC, consisting of nearly 4000 PCs with various characteristics, provides configuration and control of the detector and performs the collection, processing, selection, and conveyance of event data from the front-end electronics to mass storage. Different aspects of the farm management are already accessible via several tools. The status and health of each node are monitored by a system based on Icinga 2 and Ganglia. PuppetDB gathers centrally all the status information from Puppet, the configuration management tool used to ensure configuration consistency of every node. The in-house Configuration Database (ConfDB) controls DHCP and PXE, while also integrating external information sources. In these proceedings we present our roadmap for integrating these and other data sources and systems, and building a higher level of abstraction on top of this foundation. An automation and orchestration tool will be able to use these systems and replace lengthy manual procedures, some of which also require interactions with other systems and teams, e.g. for the repair of a faulty node. Finally, an inventory and tracking system will complement the available data sources, keep track of node history, and improve the evaluation of long-term lifecycle management and purchase strategies.ATL-DAQ-PROC-2018-038oai:cds.cern.ch:26490732018-11-27
spellingShingle Particle Physics - Experiment
Amirkhanov, Artem
Ballestrero, Sergio
Brasolin, Franco
Lee, Christopher Jon
Du Plessis, Haydn Dean
Mitrogeorgos, Konstantinos
Pernigotti, Marco
Sanchez Pineda, Arturo Rodolfo
Scannicchio, Diana Alessandra
Twomey, Matthew Shaun
Integrated automation for configuration management and operations in the ATLAS online computing farm
title Integrated automation for configuration management and operations in the ATLAS online computing farm
title_full Integrated automation for configuration management and operations in the ATLAS online computing farm
title_fullStr Integrated automation for configuration management and operations in the ATLAS online computing farm
title_full_unstemmed Integrated automation for configuration management and operations in the ATLAS online computing farm
title_short Integrated automation for configuration management and operations in the ATLAS online computing farm
title_sort integrated automation for configuration management and operations in the atlas online computing farm
topic Particle Physics - Experiment
url https://dx.doi.org/10.1051/epjconf/201921408022
http://cds.cern.ch/record/2649073
work_keys_str_mv AT amirkhanovartem integratedautomationforconfigurationmanagementandoperationsintheatlasonlinecomputingfarm
AT ballestrerosergio integratedautomationforconfigurationmanagementandoperationsintheatlasonlinecomputingfarm
AT brasolinfranco integratedautomationforconfigurationmanagementandoperationsintheatlasonlinecomputingfarm
AT leechristopherjon integratedautomationforconfigurationmanagementandoperationsintheatlasonlinecomputingfarm
AT duplessishaydndean integratedautomationforconfigurationmanagementandoperationsintheatlasonlinecomputingfarm
AT mitrogeorgoskonstantinos integratedautomationforconfigurationmanagementandoperationsintheatlasonlinecomputingfarm
AT pernigottimarco integratedautomationforconfigurationmanagementandoperationsintheatlasonlinecomputingfarm
AT sanchezpinedaarturorodolfo integratedautomationforconfigurationmanagementandoperationsintheatlasonlinecomputingfarm
AT scannicchiodianaalessandra integratedautomationforconfigurationmanagementandoperationsintheatlasonlinecomputingfarm
AT twomeymatthewshaun integratedautomationforconfigurationmanagementandoperationsintheatlasonlinecomputingfarm