Cargando…

APPLICATIONS OF EXPERT SYSTEM TECHNOLOGY IN THE ATLAS TDAQ CONTROLS FRAMEWORK

The ATLAS Trigger-DAQ system is composed of O(10000) of applications running ~1500 computers distributed over a network. To maximize the experiment run efficiency, the Trigger-DAQ control system includes advanced verification, diagnostics and complex dynamic error recovery tools, based on an expert...

Descripción completa

Detalles Bibliográficos
Autores principales: Kazarov, A, Corso-Radu, A, Lehmann, G, Magnoni, L, Murillo Garcia, R, Sloper, JE
Lenguaje:eng
Publicado: 2010
Materias:
Acceso en línea:http://cds.cern.ch/record/1279368
_version_ 1780920428294832128
author Kazarov, A
Corso-Radu, A
Lehmann, G
Magnoni, L
Murillo Garcia, R
Sloper, JE
author_facet Kazarov, A
Corso-Radu, A
Lehmann, G
Magnoni, L
Murillo Garcia, R
Sloper, JE
author_sort Kazarov, A
collection CERN
description The ATLAS Trigger-DAQ system is composed of O(10000) of applications running ~1500 computers distributed over a network. To maximize the experiment run efficiency, the Trigger-DAQ control system includes advanced verification, diagnostics and complex dynamic error recovery tools, based on an expert system. The error recovery (ER) system is responsible for analyzing and recovering from a variety of errors, both software and hardware, without stopping the data-gathering operations. The verification framework allows users to develop and configure tests for any component in the system with different levels of complexity. It can be used as a standalone test facility during the general TDAQ initialization procedure, and for diagnosing the problems which may occur at run time. A key role in both recovery and verification frameworks is played by the rule-based expert system, which is also known as a knowledge-based system, to analyze errors and decide on appropriate recovery actions. The system is composed of a dynamic set of rules that describe the TDAQ system behavior and by an inference engine that takes decisions on which actions to perform. The system is currently used on a daily basis for the operation of the ATLAS experiment. The paper describes the architecture and implementation of the TDAQ error-recovery system and verification framework with emphasis on the latest developments and experience gained over the first LHC beam ru ns.
id cern-1279368
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2010
record_format invenio
spelling cern-12793682019-09-30T06:29:59Zhttp://cds.cern.ch/record/1279368engKazarov, ACorso-Radu, ALehmann, GMagnoni, LMurillo Garcia, RSloper, JEAPPLICATIONS OF EXPERT SYSTEM TECHNOLOGY IN THE ATLAS TDAQ CONTROLS FRAMEWORKDetectors and Experimental TechniquesThe ATLAS Trigger-DAQ system is composed of O(10000) of applications running ~1500 computers distributed over a network. To maximize the experiment run efficiency, the Trigger-DAQ control system includes advanced verification, diagnostics and complex dynamic error recovery tools, based on an expert system. The error recovery (ER) system is responsible for analyzing and recovering from a variety of errors, both software and hardware, without stopping the data-gathering operations. The verification framework allows users to develop and configure tests for any component in the system with different levels of complexity. It can be used as a standalone test facility during the general TDAQ initialization procedure, and for diagnosing the problems which may occur at run time. A key role in both recovery and verification frameworks is played by the rule-based expert system, which is also known as a knowledge-based system, to analyze errors and decide on appropriate recovery actions. The system is composed of a dynamic set of rules that describe the TDAQ system behavior and by an inference engine that takes decisions on which actions to perform. The system is currently used on a daily basis for the operation of the ATLAS experiment. The paper describes the architecture and implementation of the TDAQ error-recovery system and verification framework with emphasis on the latest developments and experience gained over the first LHC beam ru ns.ATL-DAQ-SLIDE-2010-202oai:cds.cern.ch:12793682010-07-21
spellingShingle Detectors and Experimental Techniques
Kazarov, A
Corso-Radu, A
Lehmann, G
Magnoni, L
Murillo Garcia, R
Sloper, JE
APPLICATIONS OF EXPERT SYSTEM TECHNOLOGY IN THE ATLAS TDAQ CONTROLS FRAMEWORK
title APPLICATIONS OF EXPERT SYSTEM TECHNOLOGY IN THE ATLAS TDAQ CONTROLS FRAMEWORK
title_full APPLICATIONS OF EXPERT SYSTEM TECHNOLOGY IN THE ATLAS TDAQ CONTROLS FRAMEWORK
title_fullStr APPLICATIONS OF EXPERT SYSTEM TECHNOLOGY IN THE ATLAS TDAQ CONTROLS FRAMEWORK
title_full_unstemmed APPLICATIONS OF EXPERT SYSTEM TECHNOLOGY IN THE ATLAS TDAQ CONTROLS FRAMEWORK
title_short APPLICATIONS OF EXPERT SYSTEM TECHNOLOGY IN THE ATLAS TDAQ CONTROLS FRAMEWORK
title_sort applications of expert system technology in the atlas tdaq controls framework
topic Detectors and Experimental Techniques
url http://cds.cern.ch/record/1279368
work_keys_str_mv AT kazarova applicationsofexpertsystemtechnologyintheatlastdaqcontrolsframework
AT corsoradua applicationsofexpertsystemtechnologyintheatlastdaqcontrolsframework
AT lehmanng applicationsofexpertsystemtechnologyintheatlastdaqcontrolsframework
AT magnonil applicationsofexpertsystemtechnologyintheatlastdaqcontrolsframework
AT murillogarciar applicationsofexpertsystemtechnologyintheatlastdaqcontrolsframework
AT sloperje applicationsofexpertsystemtechnologyintheatlastdaqcontrolsframework