Cargando…

Grid Reliability

Thanks to the Grid, users have access to computing resources distributed all over the world. The Grid hides the complexity and the differences of its heterogeneous components. In such a distributed system, it is clearly very important that errors are detected as soon as possible, and that the proced...

Descripción completa

Detalles Bibliográficos
Autores principales: Saiz, P, Andreeva, J, Cirstoiu, C, Gaidioz, B, Herrala, J, Maguire, E J, Maier, H, Rocha, R
Lenguaje:eng
Publicado: 2007
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/119/6/062042
http://cds.cern.ch/record/1069098
_version_ 1780913332576845824
author Saiz, P
Andreeva, J
Cirstoiu, C
Gaidioz, B
Herrala, J
Maguire, E J
Maier, H
Rocha, R
author_facet Saiz, P
Andreeva, J
Cirstoiu, C
Gaidioz, B
Herrala, J
Maguire, E J
Maier, H
Rocha, R
author_sort Saiz, P
collection CERN
description Thanks to the Grid, users have access to computing resources distributed all over the world. The Grid hides the complexity and the differences of its heterogeneous components. In such a distributed system, it is clearly very important that errors are detected as soon as possible, and that the procedure to solve them is well established. We focused on two of its main elements: the workload and the data management systems. We developed an application to investigate the efficiency of the different centres. Furthermore, our system can be used to categorize the most common error messages, and control their time evolution.
id cern-1069098
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2007
record_format invenio
spelling cern-10690982022-08-17T13:37:27Zdoi:10.1088/1742-6596/119/6/062042http://cds.cern.ch/record/1069098engSaiz, PAndreeva, JCirstoiu, CGaidioz, BHerrala, JMaguire, E JMaier, HRocha, RGrid ReliabilityComputing and ComputersThanks to the Grid, users have access to computing resources distributed all over the world. The Grid hides the complexity and the differences of its heterogeneous components. In such a distributed system, it is clearly very important that errors are detected as soon as possible, and that the procedure to solve them is well established. We focused on two of its main elements: the workload and the data management systems. We developed an application to investigate the efficiency of the different centres. Furthermore, our system can be used to categorize the most common error messages, and control their time evolution.CERN-IT-Note-2007-039oai:cds.cern.ch:10690982007-10-31
spellingShingle Computing and Computers
Saiz, P
Andreeva, J
Cirstoiu, C
Gaidioz, B
Herrala, J
Maguire, E J
Maier, H
Rocha, R
Grid Reliability
title Grid Reliability
title_full Grid Reliability
title_fullStr Grid Reliability
title_full_unstemmed Grid Reliability
title_short Grid Reliability
title_sort grid reliability
topic Computing and Computers
url https://dx.doi.org/10.1088/1742-6596/119/6/062042
http://cds.cern.ch/record/1069098
work_keys_str_mv AT saizp gridreliability
AT andreevaj gridreliability
AT cirstoiuc gridreliability
AT gaidiozb gridreliability
AT herralaj gridreliability
AT maguireej gridreliability
AT maierh gridreliability
AT rochar gridreliability