Cargando…

Machine Learning Techniques in the ATLAS TDAQ Network Monitoring System

Network monitoring is of great importance for every data acquisition system (DAQ), it ensures stable and uninterrupted data flow. However, when using standard tools such as Icinga, often homogeneity of the DAQ hardware is not exploited. We will present the application of machine learning techniques...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wyszynski, Oskar Justynian, Pozo Astigarraga, Mikel Eukeni
Lenguaje:	eng
Publicado:	2019
Materias:	Particle Physics - Experiment
Acceso en línea:	http://cds.cern.ch/record/2667381

_version_	1780962063689973760
author	Wyszynski, Oskar Justynian Pozo Astigarraga, Mikel Eukeni
author_facet	Wyszynski, Oskar Justynian Pozo Astigarraga, Mikel Eukeni
author_sort	Wyszynski, Oskar Justynian
collection	CERN
description	Network monitoring is of great importance for every data acquisition system (DAQ), it ensures stable and uninterrupted data flow. However, when using standard tools such as Icinga, often homogeneity of the DAQ hardware is not exploited. We will present the application of machine learning techniques to detect anomalies among network devices as well as connection instabilities. The former exploits homogeneity of network hardware to detect device anomalies such as too high CPU or memory utilization, and consequently uncover a pre-failure state. The latter algorithm learns to distinguish between port speed instabilities caused by, e.g. failing transceiver or fiber, and speed changes due to scheduled system reboots. All the algorithms described are implemented in the DAQ network of the ATLAS experiment.
id	cern-2667381
institution	Organización Europea para la Investigación Nuclear
language	eng
publishDate	2019
record_format	invenio
spelling	cern-26673812019-09-30T06:29:59Zhttp://cds.cern.ch/record/2667381engWyszynski, Oskar JustynianPozo Astigarraga, Mikel EukeniMachine Learning Techniques in the ATLAS TDAQ Network Monitoring SystemParticle Physics - ExperimentNetwork monitoring is of great importance for every data acquisition system (DAQ), it ensures stable and uninterrupted data flow. However, when using standard tools such as Icinga, often homogeneity of the DAQ hardware is not exploited. We will present the application of machine learning techniques to detect anomalies among network devices as well as connection instabilities. The former exploits homogeneity of network hardware to detect device anomalies such as too high CPU or memory utilization, and consequently uncover a pre-failure state. The latter algorithm learns to distinguish between port speed instabilities caused by, e.g. failing transceiver or fiber, and speed changes due to scheduled system reboots. All the algorithms described are implemented in the DAQ network of the ATLAS experiment.ATL-DAQ-SLIDE-2019-082oai:cds.cern.ch:26673812019-03-18
spellingShingle	Particle Physics - Experiment Wyszynski, Oskar Justynian Pozo Astigarraga, Mikel Eukeni Machine Learning Techniques in the ATLAS TDAQ Network Monitoring System
title	Machine Learning Techniques in the ATLAS TDAQ Network Monitoring System
title_full	Machine Learning Techniques in the ATLAS TDAQ Network Monitoring System
title_fullStr	Machine Learning Techniques in the ATLAS TDAQ Network Monitoring System
title_full_unstemmed	Machine Learning Techniques in the ATLAS TDAQ Network Monitoring System
title_short	Machine Learning Techniques in the ATLAS TDAQ Network Monitoring System
title_sort	machine learning techniques in the atlas tdaq network monitoring system
topic	Particle Physics - Experiment
url	http://cds.cern.ch/record/2667381
work_keys_str_mv	AT wyszynskioskarjustynian machinelearningtechniquesintheatlastdaqnetworkmonitoringsystem AT pozoastigarragamikeleukeni machinelearningtechniquesintheatlastdaqnetworkmonitoringsystem

Machine Learning Techniques in the ATLAS TDAQ Network Monitoring System

Ejemplares similares