Cargando…
ATLAS TDAQ System Administration: an overview and evolution
The ATLAS Trigger and Data Acquisition (TDAQ) system is responsible for the online processing of live data streaming from the ATLAS experiment at the Large Hadron Collider (LHC) at CERN. The system processes the direct data readout from ~100 million channels on the detector through three trigger lev...
Autores principales: | , , , , , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2013
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/1528600 |
_version_ | 1780929474174386176 |
---|---|
author | LEE, CJ BALLESTRERO, S BOGDANCHIKOV, A BRASOLIN, F CONTESCU, AC DARLEA, GL KOROL, A SCANNICCHIO, DA TWOMEY, M VALSAN, ML |
author_facet | LEE, CJ BALLESTRERO, S BOGDANCHIKOV, A BRASOLIN, F CONTESCU, AC DARLEA, GL KOROL, A SCANNICCHIO, DA TWOMEY, M VALSAN, ML |
author_sort | LEE, CJ |
collection | CERN |
description | The ATLAS Trigger and Data Acquisition (TDAQ) system is responsible for the online processing of live data streaming from the ATLAS experiment at the Large Hadron Collider (LHC) at CERN. The system processes the direct data readout from ~100 million channels on the detector through three trigger levels, selecting interesting events for analysis with a factor of 10^7 reduction on the data rate with a latency of less than a few seconds. Most of the functionality is implemented on ~3000 servers composing the online farm. Due to the critical functionality of the system a sophisticated computing environment is maintained, covering the online farm and ATLAS control rooms, as well as a number of development and testing labs. The specificity of the system required the development of dedicated applications (e.g. ConfDB, BWM) for system configuration and maintenance; in parallel other Open Source tools (Puppet and Quattor) are used to centrally configure the operating systems. The health monitoring of the TDAQ system hardware and OS performs ~60 thousand checks every 5 minutes; it is currently implemented over Nagios, and it is being complemented and replaced by Ganglia and Icinga. The online system adopted a sophisticated user management, based on the Active Directory infrastructure and integrated with Access Manager, a dedicated Role Based Access Control (RBAC) tool. The RBAC and its underlying LDAP database control user rights from the external access to the farm down to specific user actions. A web-based user interface allows delegated administrators to manage specific role assignments. The current activities of the SysAdmin group include the daily monitoring, troubleshooting and maintenance of the online system, storage and farm upgrades, and readying systems for an upgrade to Scientific Linux 6 with the related global integration, configuration, optimisation and hardware updates necessary. In addition, during the 2013 shutdown the team will provide support for the usage of a large fraction of the online farm for GEANT4 simulations of ATLAS. |
id | cern-1528600 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2013 |
record_format | invenio |
spelling | cern-15286002019-09-30T06:29:59Zhttp://cds.cern.ch/record/1528600engLEE, CJBALLESTRERO, SBOGDANCHIKOV, ABRASOLIN, FCONTESCU, ACDARLEA, GLKOROL, ASCANNICCHIO, DATWOMEY, MVALSAN, MLATLAS TDAQ System Administration: an overview and evolutionDetectors and Experimental TechniquesThe ATLAS Trigger and Data Acquisition (TDAQ) system is responsible for the online processing of live data streaming from the ATLAS experiment at the Large Hadron Collider (LHC) at CERN. The system processes the direct data readout from ~100 million channels on the detector through three trigger levels, selecting interesting events for analysis with a factor of 10^7 reduction on the data rate with a latency of less than a few seconds. Most of the functionality is implemented on ~3000 servers composing the online farm. Due to the critical functionality of the system a sophisticated computing environment is maintained, covering the online farm and ATLAS control rooms, as well as a number of development and testing labs. The specificity of the system required the development of dedicated applications (e.g. ConfDB, BWM) for system configuration and maintenance; in parallel other Open Source tools (Puppet and Quattor) are used to centrally configure the operating systems. The health monitoring of the TDAQ system hardware and OS performs ~60 thousand checks every 5 minutes; it is currently implemented over Nagios, and it is being complemented and replaced by Ganglia and Icinga. The online system adopted a sophisticated user management, based on the Active Directory infrastructure and integrated with Access Manager, a dedicated Role Based Access Control (RBAC) tool. The RBAC and its underlying LDAP database control user rights from the external access to the farm down to specific user actions. A web-based user interface allows delegated administrators to manage specific role assignments. The current activities of the SysAdmin group include the daily monitoring, troubleshooting and maintenance of the online system, storage and farm upgrades, and readying systems for an upgrade to Scientific Linux 6 with the related global integration, configuration, optimisation and hardware updates necessary. In addition, during the 2013 shutdown the team will provide support for the usage of a large fraction of the online farm for GEANT4 simulations of ATLAS.ATL-DAQ-SLIDE-2013-081oai:cds.cern.ch:15286002013-03-18 |
spellingShingle | Detectors and Experimental Techniques LEE, CJ BALLESTRERO, S BOGDANCHIKOV, A BRASOLIN, F CONTESCU, AC DARLEA, GL KOROL, A SCANNICCHIO, DA TWOMEY, M VALSAN, ML ATLAS TDAQ System Administration: an overview and evolution |
title | ATLAS TDAQ System Administration: an overview and evolution |
title_full | ATLAS TDAQ System Administration: an overview and evolution |
title_fullStr | ATLAS TDAQ System Administration: an overview and evolution |
title_full_unstemmed | ATLAS TDAQ System Administration: an overview and evolution |
title_short | ATLAS TDAQ System Administration: an overview and evolution |
title_sort | atlas tdaq system administration: an overview and evolution |
topic | Detectors and Experimental Techniques |
url | http://cds.cern.ch/record/1528600 |
work_keys_str_mv | AT leecj atlastdaqsystemadministrationanoverviewandevolution AT ballestreros atlastdaqsystemadministrationanoverviewandevolution AT bogdanchikova atlastdaqsystemadministrationanoverviewandevolution AT brasolinf atlastdaqsystemadministrationanoverviewandevolution AT contescuac atlastdaqsystemadministrationanoverviewandevolution AT darleagl atlastdaqsystemadministrationanoverviewandevolution AT korola atlastdaqsystemadministrationanoverviewandevolution AT scannicchioda atlastdaqsystemadministrationanoverviewandevolution AT twomeym atlastdaqsystemadministrationanoverviewandevolution AT valsanml atlastdaqsystemadministrationanoverviewandevolution |