Cargando…

Alert Messaging in the CMS Distributed Workflow System

WMAgent is the core component of the CMS workload management system. One of the features of this job managing platform is a configurable messaging system aimed at generating, distributing and processing alerts: short messages describing a given alert-worthy informational or pathological condition. A...

Descripción completa

Detalles Bibliográficos
Autor principal: Maxa, Zdenek
Lenguaje:eng
Publicado: 2012
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/396/3/032074
http://cds.cern.ch/record/1457815
_version_ 1780925137920458752
author Maxa, Zdenek
author_facet Maxa, Zdenek
author_sort Maxa, Zdenek
collection CERN
description WMAgent is the core component of the CMS workload management system. One of the features of this job managing platform is a configurable messaging system aimed at generating, distributing and processing alerts: short messages describing a given alert-worthy informational or pathological condition. Apart from the framework's sub-components running within the WMAgent instances, there is a stand-alone application collecting alerts from all WMAgent instances running across the CMS distributed computing environment. The alert framework has a versatile design that allows for receiving alert messages also from other CMS production applications, such as PhEDEx data transfer manager. We present implementation details of the system, including its python implementation using ZeroMQ, CouchDB message storage and future visions as well as operational experiences. Inter-operation with monitoring platforms such as Dashboard or Lemon is described.
id cern-1457815
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2012
record_format invenio
spelling cern-14578152019-09-30T06:29:59Zdoi:10.1088/1742-6596/396/3/032074http://cds.cern.ch/record/1457815engMaxa, ZdenekAlert Messaging in the CMS Distributed Workflow SystemDetectors and Experimental TechniquesWMAgent is the core component of the CMS workload management system. One of the features of this job managing platform is a configurable messaging system aimed at generating, distributing and processing alerts: short messages describing a given alert-worthy informational or pathological condition. Apart from the framework's sub-components running within the WMAgent instances, there is a stand-alone application collecting alerts from all WMAgent instances running across the CMS distributed computing environment. The alert framework has a versatile design that allows for receiving alert messages also from other CMS production applications, such as PhEDEx data transfer manager. We present implementation details of the system, including its python implementation using ZeroMQ, CouchDB message storage and future visions as well as operational experiences. Inter-operation with monitoring platforms such as Dashboard or Lemon is described.CMS-CR-2012-119oai:cds.cern.ch:14578152012-05-29
spellingShingle Detectors and Experimental Techniques
Maxa, Zdenek
Alert Messaging in the CMS Distributed Workflow System
title Alert Messaging in the CMS Distributed Workflow System
title_full Alert Messaging in the CMS Distributed Workflow System
title_fullStr Alert Messaging in the CMS Distributed Workflow System
title_full_unstemmed Alert Messaging in the CMS Distributed Workflow System
title_short Alert Messaging in the CMS Distributed Workflow System
title_sort alert messaging in the cms distributed workflow system
topic Detectors and Experimental Techniques
url https://dx.doi.org/10.1088/1742-6596/396/3/032074
http://cds.cern.ch/record/1457815
work_keys_str_mv AT maxazdenek alertmessaginginthecmsdistributedworkflowsystem