Cargando…

Automating ATLAS Computing Operations using the Site Status Board

The automation of operations is essential to reduce manpower costs and improve the reliability of the system. The Site Status Board (SSB) is a framework which allows Virtual Organizations to monitor their computing activities at distributed sites and to evaluate site performance. The ATLAS experimen...

Descripción completa

Detalles Bibliográficos
Autores principales: Andreeva, J, Borrego Iglesias, C, Campana, S, Di Girolamo, A, Espinal Curull, X, Gayazov, S, Magradze, E, Nowotka, MM, Rinaldi, L, Saiz, P, Schovancova, J, Stewart, GA, Wright, M
Lenguaje:eng
Publicado: 2012
Materias:
Acceso en línea:http://cds.cern.ch/record/1448173
_version_ 1780924845659258880
author Andreeva, J
Borrego Iglesias, C
Campana, S
Di Girolamo, A
Espinal Curull, X
Gayazov, S
Magradze, E
Nowotka, MM
Rinaldi, L
Saiz, P
Schovancova, J
Stewart, GA
Wright, M
author_facet Andreeva, J
Borrego Iglesias, C
Campana, S
Di Girolamo, A
Espinal Curull, X
Gayazov, S
Magradze, E
Nowotka, MM
Rinaldi, L
Saiz, P
Schovancova, J
Stewart, GA
Wright, M
author_sort Andreeva, J
collection CERN
description The automation of operations is essential to reduce manpower costs and improve the reliability of the system. The Site Status Board (SSB) is a framework which allows Virtual Organizations to monitor their computing activities at distributed sites and to evaluate site performance. The ATLAS experiment intensively uses SSB for the distributed computing shifts, for estimating data processing and data transfer efficiencies at a particular site, and for implementing automatic exclusion of sites from computing activities, in case of potential problems. ATLAS SSB provides a real-time aggregated monitoring view and keeps the history of the monitoring metrics. Based on this history, usability of a site from the perspective of ATLAS is calculated. The presentation will describe how SSB is integrated in the ATLAS operations and computing infrastructure and will cover implementation details of the ATLAS SSB sensors and alarm system, based on the information in SSB. It will demonstrate the positive impact of the use of SSB on the overall performance of ATLAS computing activities and will overview future plans.
id cern-1448173
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2012
record_format invenio
spelling cern-14481732019-09-30T06:29:59Zhttp://cds.cern.ch/record/1448173engAndreeva, JBorrego Iglesias, CCampana, SDi Girolamo, AEspinal Curull, XGayazov, SMagradze, ENowotka, MMRinaldi, LSaiz, PSchovancova, JStewart, GAWright, MAutomating ATLAS Computing Operations using the Site Status BoardDetectors and Experimental TechniquesThe automation of operations is essential to reduce manpower costs and improve the reliability of the system. The Site Status Board (SSB) is a framework which allows Virtual Organizations to monitor their computing activities at distributed sites and to evaluate site performance. The ATLAS experiment intensively uses SSB for the distributed computing shifts, for estimating data processing and data transfer efficiencies at a particular site, and for implementing automatic exclusion of sites from computing activities, in case of potential problems. ATLAS SSB provides a real-time aggregated monitoring view and keeps the history of the monitoring metrics. Based on this history, usability of a site from the perspective of ATLAS is calculated. The presentation will describe how SSB is integrated in the ATLAS operations and computing infrastructure and will cover implementation details of the ATLAS SSB sensors and alarm system, based on the information in SSB. It will demonstrate the positive impact of the use of SSB on the overall performance of ATLAS computing activities and will overview future plans.ATL-SOFT-SLIDE-2012-201oai:cds.cern.ch:14481732012-05-12
spellingShingle Detectors and Experimental Techniques
Andreeva, J
Borrego Iglesias, C
Campana, S
Di Girolamo, A
Espinal Curull, X
Gayazov, S
Magradze, E
Nowotka, MM
Rinaldi, L
Saiz, P
Schovancova, J
Stewart, GA
Wright, M
Automating ATLAS Computing Operations using the Site Status Board
title Automating ATLAS Computing Operations using the Site Status Board
title_full Automating ATLAS Computing Operations using the Site Status Board
title_fullStr Automating ATLAS Computing Operations using the Site Status Board
title_full_unstemmed Automating ATLAS Computing Operations using the Site Status Board
title_short Automating ATLAS Computing Operations using the Site Status Board
title_sort automating atlas computing operations using the site status board
topic Detectors and Experimental Techniques
url http://cds.cern.ch/record/1448173
work_keys_str_mv AT andreevaj automatingatlascomputingoperationsusingthesitestatusboard
AT borregoiglesiasc automatingatlascomputingoperationsusingthesitestatusboard
AT campanas automatingatlascomputingoperationsusingthesitestatusboard
AT digirolamoa automatingatlascomputingoperationsusingthesitestatusboard
AT espinalcurullx automatingatlascomputingoperationsusingthesitestatusboard
AT gayazovs automatingatlascomputingoperationsusingthesitestatusboard
AT magradzee automatingatlascomputingoperationsusingthesitestatusboard
AT nowotkamm automatingatlascomputingoperationsusingthesitestatusboard
AT rinaldil automatingatlascomputingoperationsusingthesitestatusboard
AT saizp automatingatlascomputingoperationsusingthesitestatusboard
AT schovancovaj automatingatlascomputingoperationsusingthesitestatusboard
AT stewartga automatingatlascomputingoperationsusingthesitestatusboard
AT wrightm automatingatlascomputingoperationsusingthesitestatusboard