Cargando…
The benefits and challenges of sharing glidein factory operations across nine time zones between OSG and CMS
OSG has been operating for a few years at UCSD a glideinWMS factory for several scientific communities, including CMS analysis, HCC and GLOW. This setup worked fine, but it had become a single point of failure. OSG thus recently added another instance at Indiana University, serving the same user com...
Autores principales: | , , , , , , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2012
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1088/1742-6596/396/3/032103 http://cds.cern.ch/record/1458477 |
_version_ | 1780925160670363648 |
---|---|
author | Sfiligoi, Igor Dost, Jeffrey Michael Zvada, Marian Butenas, Ignas Holzman, Burt Wuerthwein, Frank Karl Kreuzer, Peter Teige, Scott W. Quick, Robert Hernandez, Jose M. Flix Molina, Jose |
author_facet | Sfiligoi, Igor Dost, Jeffrey Michael Zvada, Marian Butenas, Ignas Holzman, Burt Wuerthwein, Frank Karl Kreuzer, Peter Teige, Scott W. Quick, Robert Hernandez, Jose M. Flix Molina, Jose |
author_sort | Sfiligoi, Igor |
collection | CERN |
description | OSG has been operating for a few years at UCSD a glideinWMS factory for several scientific communities, including CMS analysis, HCC and GLOW. This setup worked fine, but it had become a single point of failure. OSG thus recently added another instance at Indiana University, serving the same user communities. Similarly, CMS has been operating a glidein factory dedicated to reprocessing activities at Fermilab, with similar results. Recently, CMS decided to host another glidein factory at CERN, to increase the availability of the system, both for analysis, MC and reprocessing jobs. Given the large overlap between this new factory and the three factories in the US, and given that CMS represents a significant fraction of glideins going through the OSG factories, CMS and OSG formed a common operations team that operates all of the above factories. The reasoning behind this arrangement is that most operational issues stem from Grid-related problems, and are very similar for all the factory instances. Solving a problem in one instance thus very often solves the problem for all of them. This paper presents the operational experience of how we address both the social and technical issues of running multiple instances of a glideinWMS factory with operations staff spanning multiple time zones on two continents. |
id | cern-1458477 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2012 |
record_format | invenio |
spelling | cern-14584772019-09-30T06:29:59Zdoi:10.1088/1742-6596/396/3/032103http://cds.cern.ch/record/1458477engSfiligoi, IgorDost, Jeffrey MichaelZvada, MarianButenas, IgnasHolzman, BurtWuerthwein, Frank KarlKreuzer, PeterTeige, Scott W.Quick, RobertHernandez, Jose M.Flix Molina, JoseThe benefits and challenges of sharing glidein factory operations across nine time zones between OSG and CMSDetectors and Experimental TechniquesOSG has been operating for a few years at UCSD a glideinWMS factory for several scientific communities, including CMS analysis, HCC and GLOW. This setup worked fine, but it had become a single point of failure. OSG thus recently added another instance at Indiana University, serving the same user communities. Similarly, CMS has been operating a glidein factory dedicated to reprocessing activities at Fermilab, with similar results. Recently, CMS decided to host another glidein factory at CERN, to increase the availability of the system, both for analysis, MC and reprocessing jobs. Given the large overlap between this new factory and the three factories in the US, and given that CMS represents a significant fraction of glideins going through the OSG factories, CMS and OSG formed a common operations team that operates all of the above factories. The reasoning behind this arrangement is that most operational issues stem from Grid-related problems, and are very similar for all the factory instances. Solving a problem in one instance thus very often solves the problem for all of them. This paper presents the operational experience of how we address both the social and technical issues of running multiple instances of a glideinWMS factory with operations staff spanning multiple time zones on two continents.CMS-CR-2012-068oai:cds.cern.ch:14584772012-05-10 |
spellingShingle | Detectors and Experimental Techniques Sfiligoi, Igor Dost, Jeffrey Michael Zvada, Marian Butenas, Ignas Holzman, Burt Wuerthwein, Frank Karl Kreuzer, Peter Teige, Scott W. Quick, Robert Hernandez, Jose M. Flix Molina, Jose The benefits and challenges of sharing glidein factory operations across nine time zones between OSG and CMS |
title | The benefits and challenges of sharing glidein factory operations across nine time zones between OSG and CMS |
title_full | The benefits and challenges of sharing glidein factory operations across nine time zones between OSG and CMS |
title_fullStr | The benefits and challenges of sharing glidein factory operations across nine time zones between OSG and CMS |
title_full_unstemmed | The benefits and challenges of sharing glidein factory operations across nine time zones between OSG and CMS |
title_short | The benefits and challenges of sharing glidein factory operations across nine time zones between OSG and CMS |
title_sort | benefits and challenges of sharing glidein factory operations across nine time zones between osg and cms |
topic | Detectors and Experimental Techniques |
url | https://dx.doi.org/10.1088/1742-6596/396/3/032103 http://cds.cern.ch/record/1458477 |
work_keys_str_mv | AT sfiligoiigor thebenefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms AT dostjeffreymichael thebenefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms AT zvadamarian thebenefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms AT butenasignas thebenefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms AT holzmanburt thebenefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms AT wuerthweinfrankkarl thebenefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms AT kreuzerpeter thebenefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms AT teigescottw thebenefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms AT quickrobert thebenefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms AT hernandezjosem thebenefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms AT flixmolinajose thebenefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms AT sfiligoiigor benefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms AT dostjeffreymichael benefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms AT zvadamarian benefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms AT butenasignas benefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms AT holzmanburt benefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms AT wuerthweinfrankkarl benefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms AT kreuzerpeter benefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms AT teigescottw benefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms AT quickrobert benefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms AT hernandezjosem benefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms AT flixmolinajose benefitsandchallengesofsharingglideinfactoryoperationsacrossninetimezonesbetweenosgandcms |