Cargando…

ATLAS TDAQ System Administration: an overview and evolution

The ATLAS Trigger and Data Acquisition (TDAQ) system is responsible for the online processing of live data streaming from the ATLAS experiment at the Large Hadron Collider (LHC) at CERN. The system processes the direct data readout from ~100 million channels on the detector through three trigger lev...

Descripción completa

Detalles Bibliográficos
Autores principales: LEE, CJ, BALLESTRERO, S, BOGDANCHIKOV, A, BRASOLIN, F, CONTESCU, AC, DARLEA, GL, KOROL, A, SCANNICCHIO, DA, TWOMEY, M, VALSAN, ML
Lenguaje:eng
Publicado: 2013
Materias:
Acceso en línea:http://cds.cern.ch/record/1528600
_version_ 1780929474174386176
author LEE, CJ
BALLESTRERO, S
BOGDANCHIKOV, A
BRASOLIN, F
CONTESCU, AC
DARLEA, GL
KOROL, A
SCANNICCHIO, DA
TWOMEY, M
VALSAN, ML
author_facet LEE, CJ
BALLESTRERO, S
BOGDANCHIKOV, A
BRASOLIN, F
CONTESCU, AC
DARLEA, GL
KOROL, A
SCANNICCHIO, DA
TWOMEY, M
VALSAN, ML
author_sort LEE, CJ
collection CERN
description The ATLAS Trigger and Data Acquisition (TDAQ) system is responsible for the online processing of live data streaming from the ATLAS experiment at the Large Hadron Collider (LHC) at CERN. The system processes the direct data readout from ~100 million channels on the detector through three trigger levels, selecting interesting events for analysis with a factor of 10^7 reduction on the data rate with a latency of less than a few seconds. Most of the functionality is implemented on ~3000 servers composing the online farm. Due to the critical functionality of the system a sophisticated computing environment is maintained, covering the online farm and ATLAS control rooms, as well as a number of development and testing labs. The specificity of the system required the development of dedicated applications (e.g. ConfDB, BWM) for system configuration and maintenance; in parallel other Open Source tools (Puppet and Quattor) are used to centrally configure the operating systems. The health monitoring of the TDAQ system hardware and OS performs ~60 thousand checks every 5 minutes; it is currently implemented over Nagios, and it is being complemented and replaced by Ganglia and Icinga. The online system adopted a sophisticated user management, based on the Active Directory infrastructure and integrated with Access Manager, a dedicated Role Based Access Control (RBAC) tool. The RBAC and its underlying LDAP database control user rights from the external access to the farm down to specific user actions. A web-based user interface allows delegated administrators to manage specific role assignments. The current activities of the SysAdmin group include the daily monitoring, troubleshooting and maintenance of the online system, storage and farm upgrades, and readying systems for an upgrade to Scientific Linux 6 with the related global integration, configuration, optimisation and hardware updates necessary. In addition, during the 2013 shutdown the team will provide support for the usage of a large fraction of the online farm for GEANT4 simulations of ATLAS.
id cern-1528600
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2013
record_format invenio
spelling cern-15286002019-09-30T06:29:59Zhttp://cds.cern.ch/record/1528600engLEE, CJBALLESTRERO, SBOGDANCHIKOV, ABRASOLIN, FCONTESCU, ACDARLEA, GLKOROL, ASCANNICCHIO, DATWOMEY, MVALSAN, MLATLAS TDAQ System Administration: an overview and evolutionDetectors and Experimental TechniquesThe ATLAS Trigger and Data Acquisition (TDAQ) system is responsible for the online processing of live data streaming from the ATLAS experiment at the Large Hadron Collider (LHC) at CERN. The system processes the direct data readout from ~100 million channels on the detector through three trigger levels, selecting interesting events for analysis with a factor of 10^7 reduction on the data rate with a latency of less than a few seconds. Most of the functionality is implemented on ~3000 servers composing the online farm. Due to the critical functionality of the system a sophisticated computing environment is maintained, covering the online farm and ATLAS control rooms, as well as a number of development and testing labs. The specificity of the system required the development of dedicated applications (e.g. ConfDB, BWM) for system configuration and maintenance; in parallel other Open Source tools (Puppet and Quattor) are used to centrally configure the operating systems. The health monitoring of the TDAQ system hardware and OS performs ~60 thousand checks every 5 minutes; it is currently implemented over Nagios, and it is being complemented and replaced by Ganglia and Icinga. The online system adopted a sophisticated user management, based on the Active Directory infrastructure and integrated with Access Manager, a dedicated Role Based Access Control (RBAC) tool. The RBAC and its underlying LDAP database control user rights from the external access to the farm down to specific user actions. A web-based user interface allows delegated administrators to manage specific role assignments. The current activities of the SysAdmin group include the daily monitoring, troubleshooting and maintenance of the online system, storage and farm upgrades, and readying systems for an upgrade to Scientific Linux 6 with the related global integration, configuration, optimisation and hardware updates necessary. In addition, during the 2013 shutdown the team will provide support for the usage of a large fraction of the online farm for GEANT4 simulations of ATLAS.ATL-DAQ-SLIDE-2013-081oai:cds.cern.ch:15286002013-03-18
spellingShingle Detectors and Experimental Techniques
LEE, CJ
BALLESTRERO, S
BOGDANCHIKOV, A
BRASOLIN, F
CONTESCU, AC
DARLEA, GL
KOROL, A
SCANNICCHIO, DA
TWOMEY, M
VALSAN, ML
ATLAS TDAQ System Administration: an overview and evolution
title ATLAS TDAQ System Administration: an overview and evolution
title_full ATLAS TDAQ System Administration: an overview and evolution
title_fullStr ATLAS TDAQ System Administration: an overview and evolution
title_full_unstemmed ATLAS TDAQ System Administration: an overview and evolution
title_short ATLAS TDAQ System Administration: an overview and evolution
title_sort atlas tdaq system administration: an overview and evolution
topic Detectors and Experimental Techniques
url http://cds.cern.ch/record/1528600
work_keys_str_mv AT leecj atlastdaqsystemadministrationanoverviewandevolution
AT ballestreros atlastdaqsystemadministrationanoverviewandevolution
AT bogdanchikova atlastdaqsystemadministrationanoverviewandevolution
AT brasolinf atlastdaqsystemadministrationanoverviewandevolution
AT contescuac atlastdaqsystemadministrationanoverviewandevolution
AT darleagl atlastdaqsystemadministrationanoverviewandevolution
AT korola atlastdaqsystemadministrationanoverviewandevolution
AT scannicchioda atlastdaqsystemadministrationanoverviewandevolution
AT twomeym atlastdaqsystemadministrationanoverviewandevolution
AT valsanml atlastdaqsystemadministrationanoverviewandevolution