Cargando…

Predictive analytics as an essential mechanism for situational awareness at the ATLAS Production System

The workflow management process should be under the control of the certain service that is able to forecast the processing time dynamically according to the status of the processing environment and workflow itself, and to react immediately on any abnormal behaviour of the execution process. Such sit...

Descripción completa

Detalles Bibliográficos
Autores principales: Titov, Mikhail, Gubin, Maksim, Klimentov, Alexei, Barreiro Megino, Fernando Harald, Borodin, Mikhail, Golubkov, Dmitry
Lenguaje:eng
Publicado: 2017
Materias:
Acceso en línea:http://cds.cern.ch/record/2293312
_version_ 1780956522046554112
author Titov, Mikhail
Gubin, Maksim
Klimentov, Alexei
Barreiro Megino, Fernando Harald
Borodin, Mikhail
Golubkov, Dmitry
author_facet Titov, Mikhail
Gubin, Maksim
Klimentov, Alexei
Barreiro Megino, Fernando Harald
Borodin, Mikhail
Golubkov, Dmitry
author_sort Titov, Mikhail
collection CERN
description The workflow management process should be under the control of the certain service that is able to forecast the processing time dynamically according to the status of the processing environment and workflow itself, and to react immediately on any abnormal behaviour of the execution process. Such situational awareness analytic service would provide the possibility to monitor the execution process, to detect the source of any malfunction, and to optimize the management process. The stated service for the second generation of the ATLAS Production System (ProdSys2, an automated scheduling system) is based on predictive analytics approach to estimate the duration of the data processings (in terms of ProdSys2, it is task and chain of tasks) with later usage in decision making processes. Machine learning ensemble methods are chosen to estimate completion time (i.e., “Time To Complete”, TTC) for every (production) task and chain of tasks, thus “abnormal” task processing times would warn about possible failure state of the system. This is the primary phase of the service and its precision is crucial. The first implementation of such analytic service already includes Task TTC Estimator tool and is designed in a way to provide a comprehensive set of options to adjust the analysis process and possibility to extend its functionality.
id cern-2293312
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2017
record_format invenio
spelling cern-22933122019-09-30T06:29:59Zhttp://cds.cern.ch/record/2293312engTitov, MikhailGubin, MaksimKlimentov, AlexeiBarreiro Megino, Fernando HaraldBorodin, MikhailGolubkov, DmitryPredictive analytics as an essential mechanism for situational awareness at the ATLAS Production SystemParticle Physics - ExperimentThe workflow management process should be under the control of the certain service that is able to forecast the processing time dynamically according to the status of the processing environment and workflow itself, and to react immediately on any abnormal behaviour of the execution process. Such situational awareness analytic service would provide the possibility to monitor the execution process, to detect the source of any malfunction, and to optimize the management process. The stated service for the second generation of the ATLAS Production System (ProdSys2, an automated scheduling system) is based on predictive analytics approach to estimate the duration of the data processings (in terms of ProdSys2, it is task and chain of tasks) with later usage in decision making processes. Machine learning ensemble methods are chosen to estimate completion time (i.e., “Time To Complete”, TTC) for every (production) task and chain of tasks, thus “abnormal” task processing times would warn about possible failure state of the system. This is the primary phase of the service and its precision is crucial. The first implementation of such analytic service already includes Task TTC Estimator tool and is designed in a way to provide a comprehensive set of options to adjust the analysis process and possibility to extend its functionality.ATL-SOFT-PROC-2017-071oai:cds.cern.ch:22933122017-11-20
spellingShingle Particle Physics - Experiment
Titov, Mikhail
Gubin, Maksim
Klimentov, Alexei
Barreiro Megino, Fernando Harald
Borodin, Mikhail
Golubkov, Dmitry
Predictive analytics as an essential mechanism for situational awareness at the ATLAS Production System
title Predictive analytics as an essential mechanism for situational awareness at the ATLAS Production System
title_full Predictive analytics as an essential mechanism for situational awareness at the ATLAS Production System
title_fullStr Predictive analytics as an essential mechanism for situational awareness at the ATLAS Production System
title_full_unstemmed Predictive analytics as an essential mechanism for situational awareness at the ATLAS Production System
title_short Predictive analytics as an essential mechanism for situational awareness at the ATLAS Production System
title_sort predictive analytics as an essential mechanism for situational awareness at the atlas production system
topic Particle Physics - Experiment
url http://cds.cern.ch/record/2293312
work_keys_str_mv AT titovmikhail predictiveanalyticsasanessentialmechanismforsituationalawarenessattheatlasproductionsystem
AT gubinmaksim predictiveanalyticsasanessentialmechanismforsituationalawarenessattheatlasproductionsystem
AT klimentovalexei predictiveanalyticsasanessentialmechanismforsituationalawarenessattheatlasproductionsystem
AT barreiromeginofernandoharald predictiveanalyticsasanessentialmechanismforsituationalawarenessattheatlasproductionsystem
AT borodinmikhail predictiveanalyticsasanessentialmechanismforsituationalawarenessattheatlasproductionsystem
AT golubkovdmitry predictiveanalyticsasanessentialmechanismforsituationalawarenessattheatlasproductionsystem