Cargando…

Job Life Cycle Management Libraries for CMS Workflow Management Projects

Scientific analysis and simulation requires the processing and generation of millions of data samples. These processing and generation tasks are often comprised of multiple smaller tasks divided over multiple (computing) sites. This paper discusses the Compact Muon Solenoid (CMS) workflow infrastruc...

Descripción completa

Detalles Bibliográficos
Autores principales: Van Lingen, Frank, Evans, Dave, Metson, Simon, Wakefield, Stuart, Wilkinson, Rick, Jackson, James, Spiga, Daniele, Foulkes, Stephen, Afaq, Anzar, Kuznetsov, Valentin, Vaandering, Eric, Ryu, Seangchan, Farina, Fabio, Codispoti, Giuseppe, Cinquilli, Mattia
Lenguaje:eng
Publicado: 2009
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/219/4/042024
http://cds.cern.ch/record/1196166
Descripción
Sumario:Scientific analysis and simulation requires the processing and generation of millions of data samples. These processing and generation tasks are often comprised of multiple smaller tasks divided over multiple (computing) sites. This paper discusses the Compact Muon Solenoid (CMS) workflow infrastructure, and specifically the Python based workflow library which is used for so called task lifecycle management. The CMS workflow infrastructure consists of three layers: high level specification of the various tasks based on input/output datasets, life cycle management of task instances derived from the high level specification and execution management. The workflow library is the result of a convergence of three CMS subprojects that respectively deal with scientific analysis, simulation and real time data aggregation from the experiment.