Cargando…

A Lightweight Monitoring and Accounting System for LHCb DC'04 Production

The LHCb Data Challenge 04 includes the simulation of over 200 M simulated events using distributed computing resources on N sites and extending along 3 months. To achieve this goal a dedicated Production grid (DIRAC) has been deployed. We will present the Job Monitoring and Accounting services deve...

Descripción completa

Detalles Bibliográficos
Autores principales: Garonne, V, Graciani Días, R, Saborido Silva, J J, Sánchez García, M, Vizcaya Carrillo, R
Lenguaje:eng
Publicado: 2004
Materias:
Acceso en línea:http://cds.cern.ch/record/1490632
Descripción
Sumario:The LHCb Data Challenge 04 includes the simulation of over 200 M simulated events using distributed computing resources on N sites and extending along 3 months. To achieve this goal a dedicated Production grid (DIRAC) has been deployed. We will present the Job Monitoring and Accounting services developed to follow the status of the production along its way and to evaluate the results at the end of the Data Challenge. The end user connects with a web browser to WEB-SERVER applications showing dynamic reports for a whole set of possible queries. These applications in turn interrogate the Job Monitoring Service of the DIRAC Workload Management system and Accounting Database service by means of dedicated XML-RPC interfaces, querying for the information requested by the user. The reports provide an uniform view of the usage of the computing resources available. All the system components are implemented as a set of cooperating python classes following the design choice of LHCb. The different services are distributed over a number of independent machines. This allows to achieve the scalability level of multiple thousands of concurrent jobs monitored by the system