Cargando…

The benefits and challenges of sharing glidein factory operations across nine time zones between OSG and CMS

OSG has been operating for a few years at UCSD a glideinWMS factory for several scientific communities, including CMS analysis, HCC and GLOW. This setup worked fine, but it had become a single point of failure. OSG thus recently added another instance at Indiana University, serving the same user com...

Descripción completa

Detalles Bibliográficos
Autores principales: Sfiligoi, Igor, Dost, Jeffrey Michael, Zvada, Marian, Butenas, Ignas, Holzman, Burt, Wuerthwein, Frank Karl, Kreuzer, Peter, Teige, Scott W., Quick, Robert, Hernandez, Jose M., Flix Molina, Jose
Lenguaje:eng
Publicado: 2012
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/396/3/032103
http://cds.cern.ch/record/1458477
_version_ 1780925160670363648
author Sfiligoi, Igor
Dost, Jeffrey Michael
Zvada, Marian
Butenas, Ignas
Holzman, Burt
Wuerthwein, Frank Karl
Kreuzer, Peter
Teige, Scott W.
Quick, Robert
Hernandez, Jose M.
Flix Molina, Jose
author_facet Sfiligoi, Igor
Dost, Jeffrey Michael
Zvada, Marian
Butenas, Ignas
Holzman, Burt
Wuerthwein, Frank Karl
Kreuzer, Peter
Teige, Scott W.
Quick, Robert
Hernandez, Jose M.
Flix Molina, Jose
author_sort Sfiligoi, Igor
collection CERN
description OSG has been operating for a few years at UCSD a glideinWMS factory for several scientific communities, including CMS analysis, HCC and GLOW. This setup worked fine, but it had become a single point of failure. OSG thus recently added another instance at Indiana University, serving the same user communities. Similarly, CMS has been operating a glidein factory dedicated to reprocessing activities at Fermilab, with similar results. Recently, CMS decided to host another glidein factory at CERN, to increase the availability of the system, both for analysis, MC and reprocessing jobs. Given the large overlap between this new factory and the three factories in the US, and given that CMS represents a significant fraction of glideins going through the OSG factories, CMS and OSG formed a common operations team that operates all of the above factories. The reasoning behind this arrangement is that most operational issues stem from Grid-related problems, and are very similar for all the factory instances. Solving a problem in one instance thus very often solves the problem for all of them. This paper presents the operational experience of how we address both the social and technical issues of running multiple instances of a glideinWMS factory with operations staff spanning multiple time zones on two continents.
id cern-1458477
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2012
record_format invenio
spelling cern-14584772019-09-30T06:29:59Zdoi:10.1088/1742-6596/396/3/032103http://cds.cern.ch/record/1458477engSfiligoi, IgorDost, Jeffrey MichaelZvada, MarianButenas, IgnasHolzman, BurtWuerthwein, Frank KarlKreuzer, PeterTeige, Scott W.Quick, RobertHernandez, Jose M.Flix Molina, JoseThe benefits and challenges of sharing glidein factory operations across nine time zones between OSG and CMSDetectors and Experimental TechniquesOSG has been operating for a few years at UCSD a glideinWMS factory for several scientific communities, including CMS analysis, HCC and GLOW. This setup worked fine, but it had become a single point of failure. OSG thus recently added another instance at Indiana University, serving the same user communities. Similarly, CMS has been operating a glidein factory dedicated to reprocessing activities at Fermilab, with similar results. Recently, CMS decided to host another glidein factory at CERN, to increase the availability of the system, both for analysis, MC and reprocessing jobs. Given the large overlap between this new factory and the three factories in the US, and given that CMS represents a significant fraction of glideins going through the OSG factories, CMS and OSG formed a common operations team that operates all of the above factories. The reasoning behind this arrangement is that most operational issues stem from Grid-related problems, and are very similar for all the factory instances. Solving a problem in one instance thus very often solves the problem for all of them. This paper presents the operational experience of how we address both the social and technical issues of running multiple instances of a glideinWMS factory with operations staff spanning multiple time zones on two continents.CMS-CR-2012-068oai:cds.cern.ch:14584772012-05-10
spellingShingle Detectors and Experimental Techniques
Sfiligoi, Igor
Dost, Jeffrey Michael
Zvada, Marian
Butenas, Ignas
Holzman, Burt
Wuerthwein, Frank Karl
Kreuzer, Peter
Teige, Scott W.
Quick, Robert
Hernandez, Jose M.
Flix Molina, Jose
The benefits and challenges of sharing glidein factory operations across nine time zones between OSG and CMS
title The benefits and challenges of sharing glidein factory operations across nine time zones between OSG and CMS
title_full The benefits and challenges of sharing glidein factory operations across nine time zones between OSG and CMS
title_fullStr The benefits and challenges of sharing glidein factory operations across nine time zones between OSG and CMS
title_full_unstemmed The benefits and challenges of sharing glidein factory operations across nine time zones between OSG and CMS
title_short The benefits and challenges of sharing glidein factory operations across nine time zones between OSG and CMS
title_sort benefits and challenges of sharing glidein factory operations across nine time zones between osg and cms
topic Detectors and Experimental Techniques
url https://dx.doi.org/10.1088/1742-6596/396/3/032103
http://cds.cern.ch/record/1458477
work_keys_str_mv AT sfiligoiigor thebenefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms
AT dostjeffreymichael thebenefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms
AT zvadamarian thebenefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms
AT butenasignas thebenefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms
AT holzmanburt thebenefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms
AT wuerthweinfrankkarl thebenefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms
AT kreuzerpeter thebenefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms
AT teigescottw thebenefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms
AT quickrobert thebenefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms
AT hernandezjosem thebenefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms
AT flixmolinajose thebenefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms
AT sfiligoiigor benefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms
AT dostjeffreymichael benefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms
AT zvadamarian benefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms
AT butenasignas benefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms
AT holzmanburt benefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms
AT wuerthweinfrankkarl benefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms
AT kreuzerpeter benefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms
AT teigescottw benefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms
AT quickrobert benefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms
AT hernandezjosem benefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms
AT flixmolinajose benefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms