Cargando…

LHCb: LHCb Distributed Computing Operations

The proliferation of tools for monitoring both activities and infrastructure, together with the pressing need for prompt reaction in case of problems impacting data taking, data reconstruction, data reprocessing and user analysis brought to the need of better organizing the huge amount of informatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Stagni, F, Santinelli, R
Lenguaje:eng
Publicado: 2011
Acceso en línea:http://cds.cern.ch/record/1379877
_version_ 1780923060860223488
author Stagni, F
Santinelli, R
author_facet Stagni, F
Santinelli, R
author_sort Stagni, F
collection CERN
description The proliferation of tools for monitoring both activities and infrastructure, together with the pressing need for prompt reaction in case of problems impacting data taking, data reconstruction, data reprocessing and user analysis brought to the need of better organizing the huge amount of information available. The monitoring system for the LHCb Grid Computing relies on many heterogeneous and independent sources of information offering different views for a better understanding of problems while an operations team and defined procedures have been put in place to handle them. This work summarizes the state-of-the-art of LHCb Grid operations emphasizing the reasons that brought to various choices and what are the tools currently in use to run our daily activities. We highlight the most common problems experienced across years of activities on the WLCG infrastructure, the services with their criticality, the procedures in place, the relevant metrics and the tools available and the ones still missing.
id cern-1379877
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2011
record_format invenio
spelling cern-13798772019-09-30T06:29:59Zhttp://cds.cern.ch/record/1379877engStagni, FSantinelli, RLHCb: LHCb Distributed Computing OperationsThe proliferation of tools for monitoring both activities and infrastructure, together with the pressing need for prompt reaction in case of problems impacting data taking, data reconstruction, data reprocessing and user analysis brought to the need of better organizing the huge amount of information available. The monitoring system for the LHCb Grid Computing relies on many heterogeneous and independent sources of information offering different views for a better understanding of problems while an operations team and defined procedures have been put in place to handle them. This work summarizes the state-of-the-art of LHCb Grid operations emphasizing the reasons that brought to various choices and what are the tools currently in use to run our daily activities. We highlight the most common problems experienced across years of activities on the WLCG infrastructure, the services with their criticality, the procedures in place, the relevant metrics and the tools available and the ones still missing.Poster-2011-194oai:cds.cern.ch:13798772011-09-05
spellingShingle Stagni, F
Santinelli, R
LHCb: LHCb Distributed Computing Operations
title LHCb: LHCb Distributed Computing Operations
title_full LHCb: LHCb Distributed Computing Operations
title_fullStr LHCb: LHCb Distributed Computing Operations
title_full_unstemmed LHCb: LHCb Distributed Computing Operations
title_short LHCb: LHCb Distributed Computing Operations
title_sort lhcb: lhcb distributed computing operations
url http://cds.cern.ch/record/1379877
work_keys_str_mv AT stagnif lhcblhcbdistributedcomputingoperations
AT santinellir lhcblhcbdistributedcomputingoperations