Cargando…

Aggregated monitoring and automatic site exclusion of the ATLAS computing activities: the ATLAS Site Status Board

In the context of the Large Hadron Collider (LHC) at the European Organization for Nuclear Research (CERN), ATLAS (A Toroidal LHC Apparatus) is one of the six particle detectors constructed at the accelerator. ATLAS experiment generates large amounts of raw data which are analysed on different activ...

Descripción completa

Detalles Bibliográficos
Autores principales: Borrego, C, Di Girolamo, A, Espinal, X, Rinaldi, L, Schovancova, J, Andreeva, J, Nowotka, M M, Saiz, P
Lenguaje:eng
Publicado: 2011
Materias:
Acceso en línea:http://cds.cern.ch/record/1341856
Descripción
Sumario:In the context of the Large Hadron Collider (LHC) at the European Organization for Nuclear Research (CERN), ATLAS (A Toroidal LHC Apparatus) is one of the six particle detectors constructed at the accelerator. ATLAS experiment generates large amounts of raw data which are analysed on different activities by tens of sites all around the world. There are many different monitoring tools spread around the different sites to check the status of the different activities. The ATLAS Site Status Board (SSB) is a framework to monitor the overall status of the ATLAS distributed computing activities in the sites. From another hand, with this monitoring information we have created an infrastructure to automatically exclude and re-include sites in the different activities on the basis of dynamic policy. In this paper we present the infrastructure architecture, implementation details and lessons learned.