Cargando…

Implementing High Availability with Cots Components and Open-Source Software

High Availability of IT services is essential for the successful operation of large experimental facilities such as the LHC experiments. In the past, high availability was often taken for granted and/or ensured by using very expensive high-end hardware based on proprietary, single-vendor solutions....

Descripción completa

Detalles Bibliográficos
Autores principales: Schwemmer, R, Neufeld, Niko
Lenguaje:eng
Publicado: 2009
Materias:
Acceso en línea:http://cds.cern.ch/record/1215283
_version_ 1780918091075551232
author Schwemmer, R
Neufeld, Niko
author_facet Schwemmer, R
Neufeld, Niko
author_sort Schwemmer, R
collection CERN
description High Availability of IT services is essential for the successful operation of large experimental facilities such as the LHC experiments. In the past, high availability was often taken for granted and/or ensured by using very expensive high-end hardware based on proprietary, single-vendor solutions. Today's IT infrastructure in HEP is usually a heterogeneous environment of cheap, off the shelf components which usually have no intrinsic failure tolerance and can thus not be considered reliable at all. Many services, in particular networked services like the Domain Name Service, shared storage and databases need to run on this unreliable hardware, while they are indispensable for the operation of today's control systems. We present our approach to this problem which is based on a combination of open-source tools, such as the Linux High Availability Projet and home-made tools to ensure high-availability for the LHCb Experiment Control system, which consists of over 200 servers, several hundred switches and is controlling thousands of devices ranging from custom made devices, connected to the LAN, to the servers of the event-filter farm.
id cern-1215283
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2009
record_format invenio
spelling cern-12152832019-09-30T06:29:59Zhttp://cds.cern.ch/record/1215283engSchwemmer, RNeufeld, NikoImplementing High Availability with Cots Components and Open-Source SoftwareComputing and ComputersHigh Availability of IT services is essential for the successful operation of large experimental facilities such as the LHC experiments. In the past, high availability was often taken for granted and/or ensured by using very expensive high-end hardware based on proprietary, single-vendor solutions. Today's IT infrastructure in HEP is usually a heterogeneous environment of cheap, off the shelf components which usually have no intrinsic failure tolerance and can thus not be considered reliable at all. Many services, in particular networked services like the Domain Name Service, shared storage and databases need to run on this unreliable hardware, while they are indispensable for the operation of today's control systems. We present our approach to this problem which is based on a combination of open-source tools, such as the Linux High Availability Projet and home-made tools to ensure high-availability for the LHCb Experiment Control system, which consists of over 200 servers, several hundred switches and is controlling thousands of devices ranging from custom made devices, connected to the LAN, to the servers of the event-filter farm.LHCb-PROC-2009-046LHCb-CONF-2009-046CERN-LHCb-CONF-2009-046oai:cds.cern.ch:12152832009-10-28
spellingShingle Computing and Computers
Schwemmer, R
Neufeld, Niko
Implementing High Availability with Cots Components and Open-Source Software
title Implementing High Availability with Cots Components and Open-Source Software
title_full Implementing High Availability with Cots Components and Open-Source Software
title_fullStr Implementing High Availability with Cots Components and Open-Source Software
title_full_unstemmed Implementing High Availability with Cots Components and Open-Source Software
title_short Implementing High Availability with Cots Components and Open-Source Software
title_sort implementing high availability with cots components and open-source software
topic Computing and Computers
url http://cds.cern.ch/record/1215283
work_keys_str_mv AT schwemmerr implementinghighavailabilitywithcotscomponentsandopensourcesoftware
AT neufeldniko implementinghighavailabilitywithcotscomponentsandopensourcesoftware