Cargando…

Application of rule-based data mining techniques to real time ATLAS Grid job monitoring data

The Job Execution Monitor (JEM) is a job-centric grid job monitoring software developed at the University of Wuppertal and integrated into the pilot-based “PanDA” job brokerage system leveraging physics analysis and Monte Carlo event production for the ATLAS experiment on the Worldwide LHC Computing...

Descripción completa

Detalles Bibliográficos
Autores principales: Ahrens, R, Harenberg, T, Kalinin, S, Maettig, P, Sandhoff, M, dos Santos, T, Volkmer, F
Lenguaje:eng
Publicado: 2012
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/396/3/032060
http://cds.cern.ch/record/1450171
_version_ 1780924909006880768
author Ahrens, R
Harenberg, T
Kalinin, S
Maettig, P
Sandhoff, M
dos Santos, T
Volkmer, F
author_facet Ahrens, R
Harenberg, T
Kalinin, S
Maettig, P
Sandhoff, M
dos Santos, T
Volkmer, F
author_sort Ahrens, R
collection CERN
description The Job Execution Monitor (JEM) is a job-centric grid job monitoring software developed at the University of Wuppertal and integrated into the pilot-based “PanDA” job brokerage system leveraging physics analysis and Monte Carlo event production for the ATLAS experiment on the Worldwide LHC Computing Grid (WLCG). With JEM, job progress and grid worker node health can be supervised in real time by users, site admins and shift personnel. Imminent error conditions can be detected early and countermeasures can be initiated by the Job’s owner immideatly. Grid site admins can access aggregated data of all monitored jobs to infer the site status and to detect job and Grid worker node misbehaviour. Shifters can use the same aggregated data to quickly react to site error conditions and broken production tasks. In this work, the application of novel data-centric rule based methods and data-mining techniques to the real time monitoring data is discussed. The usage of such automatic inference techniques on monitoring data to provide job and site health summary information to users and admins is presented. Finally, the provision of a secure real-time control and steering channel to the job as extension of the presented monitoring software is considered and a possible model of such the control method is presented.
id cern-1450171
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2012
record_format invenio
spelling cern-14501712019-09-30T06:29:59Zdoi:10.1088/1742-6596/396/3/032060http://cds.cern.ch/record/1450171engAhrens, RHarenberg, TKalinin, SMaettig, PSandhoff, Mdos Santos, TVolkmer, FApplication of rule-based data mining techniques to real time ATLAS Grid job monitoring dataDetectors and Experimental TechniquesThe Job Execution Monitor (JEM) is a job-centric grid job monitoring software developed at the University of Wuppertal and integrated into the pilot-based “PanDA” job brokerage system leveraging physics analysis and Monte Carlo event production for the ATLAS experiment on the Worldwide LHC Computing Grid (WLCG). With JEM, job progress and grid worker node health can be supervised in real time by users, site admins and shift personnel. Imminent error conditions can be detected early and countermeasures can be initiated by the Job’s owner immideatly. Grid site admins can access aggregated data of all monitored jobs to infer the site status and to detect job and Grid worker node misbehaviour. Shifters can use the same aggregated data to quickly react to site error conditions and broken production tasks. In this work, the application of novel data-centric rule based methods and data-mining techniques to the real time monitoring data is discussed. The usage of such automatic inference techniques on monitoring data to provide job and site health summary information to users and admins is presented. Finally, the provision of a secure real-time control and steering channel to the job as extension of the presented monitoring software is considered and a possible model of such the control method is presented.ATL-SOFT-PROC-2012-046oai:cds.cern.ch:14501712012-05-22
spellingShingle Detectors and Experimental Techniques
Ahrens, R
Harenberg, T
Kalinin, S
Maettig, P
Sandhoff, M
dos Santos, T
Volkmer, F
Application of rule-based data mining techniques to real time ATLAS Grid job monitoring data
title Application of rule-based data mining techniques to real time ATLAS Grid job monitoring data
title_full Application of rule-based data mining techniques to real time ATLAS Grid job monitoring data
title_fullStr Application of rule-based data mining techniques to real time ATLAS Grid job monitoring data
title_full_unstemmed Application of rule-based data mining techniques to real time ATLAS Grid job monitoring data
title_short Application of rule-based data mining techniques to real time ATLAS Grid job monitoring data
title_sort application of rule-based data mining techniques to real time atlas grid job monitoring data
topic Detectors and Experimental Techniques
url https://dx.doi.org/10.1088/1742-6596/396/3/032060
http://cds.cern.ch/record/1450171
work_keys_str_mv AT ahrensr applicationofrulebaseddataminingtechniquestorealtimeatlasgridjobmonitoringdata
AT harenbergt applicationofrulebaseddataminingtechniquestorealtimeatlasgridjobmonitoringdata
AT kalinins applicationofrulebaseddataminingtechniquestorealtimeatlasgridjobmonitoringdata
AT maettigp applicationofrulebaseddataminingtechniquestorealtimeatlasgridjobmonitoringdata
AT sandhoffm applicationofrulebaseddataminingtechniquestorealtimeatlasgridjobmonitoringdata
AT dossantost applicationofrulebaseddataminingtechniquestorealtimeatlasgridjobmonitoringdata
AT volkmerf applicationofrulebaseddataminingtechniquestorealtimeatlasgridjobmonitoringdata