Cargando…
Grid Reliability
Thanks to the Grid, users have access to computing resources distributed all over the world. The Grid hides the complexity and the differences of its heterogeneous components. In such a distributed system, it is clearly very important that errors are detected as soon as possible, and that the proced...
Autores principales: | , , , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2007
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1088/1742-6596/119/6/062042 http://cds.cern.ch/record/1069098 |
_version_ | 1780913332576845824 |
---|---|
author | Saiz, P Andreeva, J Cirstoiu, C Gaidioz, B Herrala, J Maguire, E J Maier, H Rocha, R |
author_facet | Saiz, P Andreeva, J Cirstoiu, C Gaidioz, B Herrala, J Maguire, E J Maier, H Rocha, R |
author_sort | Saiz, P |
collection | CERN |
description | Thanks to the Grid, users have access to computing resources distributed all over the world. The Grid hides the complexity and the differences of its heterogeneous components. In such a distributed system, it is clearly very important that errors are detected as soon as possible, and that the procedure to solve them is well established. We focused on two of its main elements: the workload and the data management systems. We developed an application to investigate the efficiency of the different centres. Furthermore, our system can be used to categorize the most common error messages, and control their time evolution. |
id | cern-1069098 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2007 |
record_format | invenio |
spelling | cern-10690982022-08-17T13:37:27Zdoi:10.1088/1742-6596/119/6/062042http://cds.cern.ch/record/1069098engSaiz, PAndreeva, JCirstoiu, CGaidioz, BHerrala, JMaguire, E JMaier, HRocha, RGrid ReliabilityComputing and ComputersThanks to the Grid, users have access to computing resources distributed all over the world. The Grid hides the complexity and the differences of its heterogeneous components. In such a distributed system, it is clearly very important that errors are detected as soon as possible, and that the procedure to solve them is well established. We focused on two of its main elements: the workload and the data management systems. We developed an application to investigate the efficiency of the different centres. Furthermore, our system can be used to categorize the most common error messages, and control their time evolution.CERN-IT-Note-2007-039oai:cds.cern.ch:10690982007-10-31 |
spellingShingle | Computing and Computers Saiz, P Andreeva, J Cirstoiu, C Gaidioz, B Herrala, J Maguire, E J Maier, H Rocha, R Grid Reliability |
title | Grid Reliability |
title_full | Grid Reliability |
title_fullStr | Grid Reliability |
title_full_unstemmed | Grid Reliability |
title_short | Grid Reliability |
title_sort | grid reliability |
topic | Computing and Computers |
url | https://dx.doi.org/10.1088/1742-6596/119/6/062042 http://cds.cern.ch/record/1069098 |
work_keys_str_mv | AT saizp gridreliability AT andreevaj gridreliability AT cirstoiuc gridreliability AT gaidiozb gridreliability AT herralaj gridreliability AT maguireej gridreliability AT maierh gridreliability AT rochar gridreliability |