Cargando…

Erratic server behavior detection using machine learning on streams of monitoring data

With the explosion of the number of distributed applications, a new dynamic server environment emerged grouping servers into clusters, utilization of which depends on the current demand for the application. To provide reliable and smooth services it is crucial to detect and fix possible erratic beha...

Descripción completa

Detalles Bibliográficos
Autores principales: Adam, Martin, Magnoni, Luca, Pilát, Martin, Adamová, Dagmar
Lenguaje:eng
Publicado: 2020
Materias:
Acceso en línea:https://dx.doi.org/10.1051/epjconf/202024507002
http://cds.cern.ch/record/2752527
_version_ 1780969332958822400
author Adam, Martin
Magnoni, Luca
Pilát, Martin
Adamová, Dagmar
author_facet Adam, Martin
Magnoni, Luca
Pilát, Martin
Adamová, Dagmar
author_sort Adam, Martin
collection CERN
description With the explosion of the number of distributed applications, a new dynamic server environment emerged grouping servers into clusters, utilization of which depends on the current demand for the application. To provide reliable and smooth services it is crucial to detect and fix possible erratic behavior of individual servers in these clusters. Use of standard techniques for this purpose requires manual work and delivers sub-optimal results. Using only application agnostic monitoring metrics our machine learning based method analyzes the recent performance of the inspected server as well as the state of the rest of the cluster, thus checking not only the behavior of the single server, but the load on the whole distributed application as well. We have implemented our method in a Spark job running in the CERN MONIT infrastructure. In this contribution we present results of testing multiple machine learning algorithms and pre-processing techniques to identify the servers erratic behavior. We also discuss the challenges of deploying our new method into production.
id oai-inspirehep.net-1832192
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2020
record_format invenio
spelling oai-inspirehep.net-18321922021-02-22T19:39:28Zdoi:10.1051/epjconf/202024507002http://cds.cern.ch/record/2752527engAdam, MartinMagnoni, LucaPilát, MartinAdamová, DagmarErratic server behavior detection using machine learning on streams of monitoring dataComputing and ComputersWith the explosion of the number of distributed applications, a new dynamic server environment emerged grouping servers into clusters, utilization of which depends on the current demand for the application. To provide reliable and smooth services it is crucial to detect and fix possible erratic behavior of individual servers in these clusters. Use of standard techniques for this purpose requires manual work and delivers sub-optimal results. Using only application agnostic monitoring metrics our machine learning based method analyzes the recent performance of the inspected server as well as the state of the rest of the cluster, thus checking not only the behavior of the single server, but the load on the whole distributed application as well. We have implemented our method in a Spark job running in the CERN MONIT infrastructure. In this contribution we present results of testing multiple machine learning algorithms and pre-processing techniques to identify the servers erratic behavior. We also discuss the challenges of deploying our new method into production.oai:inspirehep.net:18321922020
spellingShingle Computing and Computers
Adam, Martin
Magnoni, Luca
Pilát, Martin
Adamová, Dagmar
Erratic server behavior detection using machine learning on streams of monitoring data
title Erratic server behavior detection using machine learning on streams of monitoring data
title_full Erratic server behavior detection using machine learning on streams of monitoring data
title_fullStr Erratic server behavior detection using machine learning on streams of monitoring data
title_full_unstemmed Erratic server behavior detection using machine learning on streams of monitoring data
title_short Erratic server behavior detection using machine learning on streams of monitoring data
title_sort erratic server behavior detection using machine learning on streams of monitoring data
topic Computing and Computers
url https://dx.doi.org/10.1051/epjconf/202024507002
http://cds.cern.ch/record/2752527
work_keys_str_mv AT adammartin erraticserverbehaviordetectionusingmachinelearningonstreamsofmonitoringdata
AT magnoniluca erraticserverbehaviordetectionusingmachinelearningonstreamsofmonitoringdata
AT pilatmartin erraticserverbehaviordetectionusingmachinelearningonstreamsofmonitoringdata
AT adamovadagmar erraticserverbehaviordetectionusingmachinelearningonstreamsofmonitoringdata