Cargando…
Machine Learning Techniques in the ATLAS TDAQ Network Monitoring System
Network monitoring is of great importance for every data acquisition system (DAQ), it ensures stable and uninterrupted data flow. However, when using standard tools such as Icinga, often homogeneity of the DAQ hardware is not exploited. We will present the application of machine learning techniques...
Autores principales: | , |
---|---|
Lenguaje: | eng |
Publicado: |
2019
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/2667381 |
_version_ | 1780962063689973760 |
---|---|
author | Wyszynski, Oskar Justynian Pozo Astigarraga, Mikel Eukeni |
author_facet | Wyszynski, Oskar Justynian Pozo Astigarraga, Mikel Eukeni |
author_sort | Wyszynski, Oskar Justynian |
collection | CERN |
description | Network monitoring is of great importance for every data acquisition system (DAQ), it ensures stable and uninterrupted data flow. However, when using standard tools such as Icinga, often homogeneity of the DAQ hardware is not exploited. We will present the application of machine learning techniques to detect anomalies among network devices as well as connection instabilities. The former exploits homogeneity of network hardware to detect device anomalies such as too high CPU or memory utilization, and consequently uncover a pre-failure state. The latter algorithm learns to distinguish between port speed instabilities caused by, e.g. failing transceiver or fiber, and speed changes due to scheduled system reboots. All the algorithms described are implemented in the DAQ network of the ATLAS experiment. |
id | cern-2667381 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2019 |
record_format | invenio |
spelling | cern-26673812019-09-30T06:29:59Zhttp://cds.cern.ch/record/2667381engWyszynski, Oskar JustynianPozo Astigarraga, Mikel EukeniMachine Learning Techniques in the ATLAS TDAQ Network Monitoring SystemParticle Physics - ExperimentNetwork monitoring is of great importance for every data acquisition system (DAQ), it ensures stable and uninterrupted data flow. However, when using standard tools such as Icinga, often homogeneity of the DAQ hardware is not exploited. We will present the application of machine learning techniques to detect anomalies among network devices as well as connection instabilities. The former exploits homogeneity of network hardware to detect device anomalies such as too high CPU or memory utilization, and consequently uncover a pre-failure state. The latter algorithm learns to distinguish between port speed instabilities caused by, e.g. failing transceiver or fiber, and speed changes due to scheduled system reboots. All the algorithms described are implemented in the DAQ network of the ATLAS experiment.ATL-DAQ-SLIDE-2019-082oai:cds.cern.ch:26673812019-03-18 |
spellingShingle | Particle Physics - Experiment Wyszynski, Oskar Justynian Pozo Astigarraga, Mikel Eukeni Machine Learning Techniques in the ATLAS TDAQ Network Monitoring System |
title | Machine Learning Techniques in the ATLAS TDAQ Network Monitoring System |
title_full | Machine Learning Techniques in the ATLAS TDAQ Network Monitoring System |
title_fullStr | Machine Learning Techniques in the ATLAS TDAQ Network Monitoring System |
title_full_unstemmed | Machine Learning Techniques in the ATLAS TDAQ Network Monitoring System |
title_short | Machine Learning Techniques in the ATLAS TDAQ Network Monitoring System |
title_sort | machine learning techniques in the atlas tdaq network monitoring system |
topic | Particle Physics - Experiment |
url | http://cds.cern.ch/record/2667381 |
work_keys_str_mv | AT wyszynskioskarjustynian machinelearningtechniquesintheatlastdaqnetworkmonitoringsystem AT pozoastigarragamikeleukeni machinelearningtechniquesintheatlastdaqnetworkmonitoringsystem |