Cargando…

Robust and Resilient Services – How to design, build and operate them

Grid infrastructures require a high degree of fault tolerance and reliability. This can only be achieved by careful planning and detailed implementation. We describe on-going work within the WLCG project to build and run highly reliable services. Following the "a priori" analysis based on...

Descripción completa

Detalles Bibliográficos
Autores principales: Shiers, J, Méndez-Lorenzo, P, McCance, G
Lenguaje:eng
Publicado: 2007
Materias:
Acceso en línea:http://cds.cern.ch/record/1069476
_version_ 1780913346744156160
author Shiers, J
Méndez-Lorenzo, P
McCance, G
author_facet Shiers, J
Méndez-Lorenzo, P
McCance, G
author_sort Shiers, J
collection CERN
description Grid infrastructures require a high degree of fault tolerance and reliability. This can only be achieved by careful planning and detailed implementation. We describe on-going work within the WLCG project to build and run highly reliable services. Following the "a priori" analysis based on the services and service levels listed in the Memorandum of Understanding that sites participating in WLCG have signed[1], this paper provides an "a posteriori" analysis following over 2 years of production service. This work covers not only the services deployed at the Tier0 centre at CERN - which has the most stringent service requirements related to the acquisition of the raw data, the initial processing phase and the distribution of raw and processed data to Tier1 sites, but also a similar analysis for Tier1 and major Tier2 sites. The latter will be covered at a workshop that will take place shortly before the EELA conference and so will be very up-to-date.
id cern-1069476
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2007
record_format invenio
spelling cern-10694762019-09-30T06:29:59Zhttp://cds.cern.ch/record/1069476engShiers, JMéndez-Lorenzo, PMcCance, GRobust and Resilient Services – How to design, build and operate themComputing and ComputersGrid infrastructures require a high degree of fault tolerance and reliability. This can only be achieved by careful planning and detailed implementation. We describe on-going work within the WLCG project to build and run highly reliable services. Following the "a priori" analysis based on the services and service levels listed in the Memorandum of Understanding that sites participating in WLCG have signed[1], this paper provides an "a posteriori" analysis following over 2 years of production service. This work covers not only the services deployed at the Tier0 centre at CERN - which has the most stringent service requirements related to the acquisition of the raw data, the initial processing phase and the distribution of raw and processed data to Tier1 sites, but also a similar analysis for Tier1 and major Tier2 sites. The latter will be covered at a workshop that will take place shortly before the EELA conference and so will be very up-to-date.CERN-IT-Note-2007-044oai:cds.cern.ch:10694762007-11-16
spellingShingle Computing and Computers
Shiers, J
Méndez-Lorenzo, P
McCance, G
Robust and Resilient Services – How to design, build and operate them
title Robust and Resilient Services – How to design, build and operate them
title_full Robust and Resilient Services – How to design, build and operate them
title_fullStr Robust and Resilient Services – How to design, build and operate them
title_full_unstemmed Robust and Resilient Services – How to design, build and operate them
title_short Robust and Resilient Services – How to design, build and operate them
title_sort robust and resilient services – how to design, build and operate them
topic Computing and Computers
url http://cds.cern.ch/record/1069476
work_keys_str_mv AT shiersj robustandresilientservicesahowtodesignbuildandoperatethem
AT mendezlorenzop robustandresilientservicesahowtodesignbuildandoperatethem
AT mccanceg robustandresilientservicesahowtodesignbuildandoperatethem