Cargando…

LHCb: Self managing experiment resources

Within this paper we present an autonomic Computing resources management system used by LHCb for assessing the status of their Grid resources. Virtual Organizations Grids include heterogeneous resources. For example, LHC experiments very often use resources not provided by WLCG and Cloud Computing r...

Descripción completa

Detalles Bibliográficos
Autores principales: Stagni, F, Ubeda Garcia, M
Lenguaje:eng
Publicado: 2013
Acceso en línea:http://cds.cern.ch/record/1610856
_version_ 1780932041580216320
author Stagni, F
Ubeda Garcia, M
author_facet Stagni, F
Ubeda Garcia, M
author_sort Stagni, F
collection CERN
description Within this paper we present an autonomic Computing resources management system used by LHCb for assessing the status of their Grid resources. Virtual Organizations Grids include heterogeneous resources. For example, LHC experiments very often use resources not provided by WLCG and Cloud Computing resources will soon provide a non-negligible fraction of their computing power. The lack of standards and procedures across experiments and sites generated the appearance of multiple information systems, monitoring tools, ticket portals, etc... which nowadays coexist and represent a very precious source of information for running HEP experiments Computing systems as well as sites. These two facts lead to many particular solutions for a general problem: managing the experiment resources. In this paper we present how LHCb, via the DIRAC interware addressed such issues. With a renewed Central Information Schema hosting all resources metadata and a Status System ( Resource Status System ) delivering real time information, the system controls the resources topology, independently of the resource types. The Resource Status System applies data mining techniques against all possible information sources available and assesses the status changes, that are then propagated to the topology description. Obviously, giving full control to such an automated system is not risk-free. Therefore, in order to minimise the probability of misbehavior, a battery of tests has been provided in order to certify the correctness of its assessments. We will demonstrate the performance and efficiency of such a system in terms of cost reduction and reliability.
id cern-1610856
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2013
record_format invenio
spelling cern-16108562019-09-30T06:29:59Zhttp://cds.cern.ch/record/1610856engStagni, FUbeda Garcia, MLHCb: Self managing experiment resourcesWithin this paper we present an autonomic Computing resources management system used by LHCb for assessing the status of their Grid resources. Virtual Organizations Grids include heterogeneous resources. For example, LHC experiments very often use resources not provided by WLCG and Cloud Computing resources will soon provide a non-negligible fraction of their computing power. The lack of standards and procedures across experiments and sites generated the appearance of multiple information systems, monitoring tools, ticket portals, etc... which nowadays coexist and represent a very precious source of information for running HEP experiments Computing systems as well as sites. These two facts lead to many particular solutions for a general problem: managing the experiment resources. In this paper we present how LHCb, via the DIRAC interware addressed such issues. With a renewed Central Information Schema hosting all resources metadata and a Status System ( Resource Status System ) delivering real time information, the system controls the resources topology, independently of the resource types. The Resource Status System applies data mining techniques against all possible information sources available and assesses the status changes, that are then propagated to the topology description. Obviously, giving full control to such an automated system is not risk-free. Therefore, in order to minimise the probability of misbehavior, a battery of tests has been provided in order to certify the correctness of its assessments. We will demonstrate the performance and efficiency of such a system in terms of cost reduction and reliability.Poster-2013-333oai:cds.cern.ch:16108562013-10-14
spellingShingle Stagni, F
Ubeda Garcia, M
LHCb: Self managing experiment resources
title LHCb: Self managing experiment resources
title_full LHCb: Self managing experiment resources
title_fullStr LHCb: Self managing experiment resources
title_full_unstemmed LHCb: Self managing experiment resources
title_short LHCb: Self managing experiment resources
title_sort lhcb: self managing experiment resources
url http://cds.cern.ch/record/1610856
work_keys_str_mv AT stagnif lhcbselfmanagingexperimentresources
AT ubedagarciam lhcbselfmanagingexperimentresources