Cargando…

Lessons Learnt from WLCG Service Deployment

This paper summarises the main lessons learnt from deploying WLCG production services, with a focus on Reliability, Scalability, Accountability, which lead to both manageability and usability. Each topic is analysed in turn. Techniques for zero-user-visible downtime for the main service intervention...

Descripción completa

Detalles Bibliográficos
Autor principal: Shiers, J D
Lenguaje:eng
Publicado: 2007
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/119/5/052030
http://cds.cern.ch/record/1056692
_version_ 1780913085823844352
author Shiers, J D
author_facet Shiers, J D
author_sort Shiers, J D
collection CERN
description This paper summarises the main lessons learnt from deploying WLCG production services, with a focus on Reliability, Scalability, Accountability, which lead to both manageability and usability. Each topic is analysed in turn. Techniques for zero-user-visible downtime for the main service interventions are described, together with pathological cases that need special treatment. The requirements in terms of scalability are analysed, calling for as much robustness and automation in the service as possible. The different aspects of accountability - which covers measuring / tracking / logging / monitoring what is going on -- and has gone on - is examined, with the goal of attaining a manageable service. Finally, a simple analogy is drawn with the Web in terms of usability - what do we need to achieve to cross the chasm from small-scale adoption to ubiquity?
id cern-1056692
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2007
record_format invenio
spelling cern-10566922022-08-17T13:37:34Zdoi:10.1088/1742-6596/119/5/052030http://cds.cern.ch/record/1056692engShiers, J DLessons Learnt from WLCG Service DeploymentComputing and ComputersThis paper summarises the main lessons learnt from deploying WLCG production services, with a focus on Reliability, Scalability, Accountability, which lead to both manageability and usability. Each topic is analysed in turn. Techniques for zero-user-visible downtime for the main service interventions are described, together with pathological cases that need special treatment. The requirements in terms of scalability are analysed, calling for as much robustness and automation in the service as possible. The different aspects of accountability - which covers measuring / tracking / logging / monitoring what is going on -- and has gone on - is examined, with the goal of attaining a manageable service. Finally, a simple analogy is drawn with the Web in terms of usability - what do we need to achieve to cross the chasm from small-scale adoption to ubiquity?CERN-IT-Note-2007-011oai:cds.cern.ch:10566922007-09-13
spellingShingle Computing and Computers
Shiers, J D
Lessons Learnt from WLCG Service Deployment
title Lessons Learnt from WLCG Service Deployment
title_full Lessons Learnt from WLCG Service Deployment
title_fullStr Lessons Learnt from WLCG Service Deployment
title_full_unstemmed Lessons Learnt from WLCG Service Deployment
title_short Lessons Learnt from WLCG Service Deployment
title_sort lessons learnt from wlcg service deployment
topic Computing and Computers
url https://dx.doi.org/10.1088/1742-6596/119/5/052030
http://cds.cern.ch/record/1056692
work_keys_str_mv AT shiersjd lessonslearntfromwlcgservicedeployment