Cargando…

CMS experience of running glideinWMS in High Availability mode

The CMS experiment at the Large Hadron Collider is relying on the HTCondor-based glideinWMS batch system to handle most of its distributed computing needs. In order to minimize the risk of disruptions due to software and hardware problems, and also to simplify the maintenance procedures, CMS has set...

Descripción completa

Detalles Bibliográficos
Autores principales: Sfiligoi, Igor, Letts, James, Belforte, Stefano, Mc Crea, Alison Jean, Larson, Krista Elaine, Zvada, Marian, Holzman, Burt, P Mhashilkar, Bradley, Daniel Charles, Saiz Santos, Maria Dolores, Fanzago, Federica, Gutsche, Oliver, Martin, Terrence, Wuerthwein, Frank Karl
Lenguaje:eng
Publicado: 2013
Materias:
Acceso en línea:http://cds.cern.ch/record/1622280
_version_ 1780933239515381760
author Sfiligoi, Igor
Letts, James
Belforte, Stefano
Mc Crea, Alison Jean
Larson, Krista Elaine
Zvada, Marian
Holzman, Burt
P Mhashilkar
Bradley, Daniel Charles
Saiz Santos, Maria Dolores
Fanzago, Federica
Gutsche, Oliver
Martin, Terrence
Wuerthwein, Frank Karl
author_facet Sfiligoi, Igor
Letts, James
Belforte, Stefano
Mc Crea, Alison Jean
Larson, Krista Elaine
Zvada, Marian
Holzman, Burt
P Mhashilkar
Bradley, Daniel Charles
Saiz Santos, Maria Dolores
Fanzago, Federica
Gutsche, Oliver
Martin, Terrence
Wuerthwein, Frank Karl
author_sort Sfiligoi, Igor
collection CERN
description The CMS experiment at the Large Hadron Collider is relying on the HTCondor-based glideinWMS batch system to handle most of its distributed computing needs. In order to minimize the risk of disruptions due to software and hardware problems, and also to simplify the maintenance procedures, CMS has set up its glideinWMS instance to use most of the attainable High Availability (HA) features. The setup involves running services distributed over multiple nodes, which in turn are located in several physical locations, including Geneva, Switzerland, Chicago, Illinois and San Diego, California. This paper describes the setup used by CMS, the HA limits of this setup, as well as a description of the actual operational experience spanning many months.
id cern-1622280
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2013
record_format invenio
spelling cern-16222802019-09-30T06:29:59Zhttp://cds.cern.ch/record/1622280engSfiligoi, IgorLetts, JamesBelforte, StefanoMc Crea, Alison JeanLarson, Krista ElaineZvada, MarianHolzman, BurtP MhashilkarBradley, Daniel CharlesSaiz Santos, Maria DoloresFanzago, FedericaGutsche, OliverMartin, TerrenceWuerthwein, Frank KarlCMS experience of running glideinWMS in High Availability modeDetectors and Experimental TechniquesThe CMS experiment at the Large Hadron Collider is relying on the HTCondor-based glideinWMS batch system to handle most of its distributed computing needs. In order to minimize the risk of disruptions due to software and hardware problems, and also to simplify the maintenance procedures, CMS has set up its glideinWMS instance to use most of the attainable High Availability (HA) features. The setup involves running services distributed over multiple nodes, which in turn are located in several physical locations, including Geneva, Switzerland, Chicago, Illinois and San Diego, California. This paper describes the setup used by CMS, the HA limits of this setup, as well as a description of the actual operational experience spanning many months.CMS-CR-2013-369oai:cds.cern.ch:16222802013-10-29
spellingShingle Detectors and Experimental Techniques
Sfiligoi, Igor
Letts, James
Belforte, Stefano
Mc Crea, Alison Jean
Larson, Krista Elaine
Zvada, Marian
Holzman, Burt
P Mhashilkar
Bradley, Daniel Charles
Saiz Santos, Maria Dolores
Fanzago, Federica
Gutsche, Oliver
Martin, Terrence
Wuerthwein, Frank Karl
CMS experience of running glideinWMS in High Availability mode
title CMS experience of running glideinWMS in High Availability mode
title_full CMS experience of running glideinWMS in High Availability mode
title_fullStr CMS experience of running glideinWMS in High Availability mode
title_full_unstemmed CMS experience of running glideinWMS in High Availability mode
title_short CMS experience of running glideinWMS in High Availability mode
title_sort cms experience of running glideinwms in high availability mode
topic Detectors and Experimental Techniques
url http://cds.cern.ch/record/1622280
work_keys_str_mv AT sfiligoiigor cmsexperienceofrunningglideinwmsinhighavailabilitymode
AT lettsjames cmsexperienceofrunningglideinwmsinhighavailabilitymode
AT belfortestefano cmsexperienceofrunningglideinwmsinhighavailabilitymode
AT mccreaalisonjean cmsexperienceofrunningglideinwmsinhighavailabilitymode
AT larsonkristaelaine cmsexperienceofrunningglideinwmsinhighavailabilitymode
AT zvadamarian cmsexperienceofrunningglideinwmsinhighavailabilitymode
AT holzmanburt cmsexperienceofrunningglideinwmsinhighavailabilitymode
AT pamhashilkar cmsexperienceofrunningglideinwmsinhighavailabilitymode
AT bradleydanielcharles cmsexperienceofrunningglideinwmsinhighavailabilitymode
AT saizsantosmariadolores cmsexperienceofrunningglideinwmsinhighavailabilitymode
AT fanzagofederica cmsexperienceofrunningglideinwmsinhighavailabilitymode
AT gutscheoliver cmsexperienceofrunningglideinwmsinhighavailabilitymode
AT martinterrence cmsexperienceofrunningglideinwmsinhighavailabilitymode
AT wuerthweinfrankkarl cmsexperienceofrunningglideinwmsinhighavailabilitymode