Cargando…
A gLite FTS based solution for managing user output in CMS
The CMS distributed data analysis workflow assumes that jobs run in a different location to where their results are finally stored. Typically the user output must be transferred across the network from one site to another, possibly on a different continent or over links not necessarily validated for...
Autor principal: | |
---|---|
Lenguaje: | eng |
Publicado: |
2012
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/1458471 |
_version_ | 1780925159828357120 |
---|---|
author | Spiga, Daniele |
author_facet | Spiga, Daniele |
author_sort | Spiga, Daniele |
collection | CERN |
description | The CMS distributed data analysis workflow assumes that jobs run in a
different location to where their results are finally stored. Typically
the user output must be transferred across the network from one site to
another, possibly on a different continent or over links not necessarily
validated for high bandwidth/high reliability transfer. This step is named
stage-out and in CMS was originally implemented as a synchronous step of
the job execution. However, our experience showed the weakness of this
approach both in terms of low total job execution efficiency and failure
rates, wasting precious CPU resources.
The nature of analysis data makes it inappropriate to use PhEDEx, CMS'
core data placement system. As part of the new generation of CMS Workload
Management tools, the Asynchronous Stage-Out system (AsyncStageOut) has
been developed to enable third party copy of the user output. The
AsyncStageOut component manages glite FTS transfers of data from a
temporary store at the site where the job ran to the final location of the
data on behalf of that data owner.
The tool uses python daemons, built using the WMCore framework, talking to
CouchDB, to manage the queue of work and FTS transfers. CouchDB also
provides the platform for a dedicated operations monitoring system.
In this paper, we present the motivations of the asynchronous stage-out
system. We give an insight into the design and the implementation of key
features, describing how it is coupled with the CMS workload management
system. Finally, we show the results and the commissioning experience. |
id | cern-1458471 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2012 |
record_format | invenio |
spelling | cern-14584712019-09-30T06:29:59Zhttp://cds.cern.ch/record/1458471engSpiga, DanieleA gLite FTS based solution for managing user output in CMSDetectors and Experimental TechniquesThe CMS distributed data analysis workflow assumes that jobs run in a different location to where their results are finally stored. Typically the user output must be transferred across the network from one site to another, possibly on a different continent or over links not necessarily validated for high bandwidth/high reliability transfer. This step is named stage-out and in CMS was originally implemented as a synchronous step of the job execution. However, our experience showed the weakness of this approach both in terms of low total job execution efficiency and failure rates, wasting precious CPU resources. The nature of analysis data makes it inappropriate to use PhEDEx, CMS' core data placement system. As part of the new generation of CMS Workload Management tools, the Asynchronous Stage-Out system (AsyncStageOut) has been developed to enable third party copy of the user output. The AsyncStageOut component manages glite FTS transfers of data from a temporary store at the site where the job ran to the final location of the data on behalf of that data owner. The tool uses python daemons, built using the WMCore framework, talking to CouchDB, to manage the queue of work and FTS transfers. CouchDB also provides the platform for a dedicated operations monitoring system. In this paper, we present the motivations of the asynchronous stage-out system. We give an insight into the design and the implementation of key features, describing how it is coupled with the CMS workload management system. Finally, we show the results and the commissioning experience.CMS-CR-2012-141oai:cds.cern.ch:14584712012-06-13 |
spellingShingle | Detectors and Experimental Techniques Spiga, Daniele A gLite FTS based solution for managing user output in CMS |
title | A gLite FTS based solution for managing user output in CMS |
title_full | A gLite FTS based solution for managing user output in CMS |
title_fullStr | A gLite FTS based solution for managing user output in CMS |
title_full_unstemmed | A gLite FTS based solution for managing user output in CMS |
title_short | A gLite FTS based solution for managing user output in CMS |
title_sort | glite fts based solution for managing user output in cms |
topic | Detectors and Experimental Techniques |
url | http://cds.cern.ch/record/1458471 |
work_keys_str_mv | AT spigadaniele agliteftsbasedsolutionformanaginguseroutputincms AT spigadaniele gliteftsbasedsolutionformanaginguseroutputincms |