Cargando…

Anomaly Detection for the Centralised Elasticsearch Service at CERN

For several years CERN has been offering a centralised service for Elasticsearch, a popular distributed system for search and analytics of user provided data. The service offered by CERN IT is better described as a service of services, delivering centrally managed and maintained Elasticsearch instan...

Descripción completa

Detalles Bibliográficos
Autores principales: Andersson, Jennifer R., Moya, Jose Alonso, Schwickerath, Ulrich
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8637786/
https://www.ncbi.nlm.nih.gov/pubmed/34870189
http://dx.doi.org/10.3389/fdata.2021.718879
_version_ 1784608816601497600
author Andersson, Jennifer R.
Moya, Jose Alonso
Schwickerath, Ulrich
author_facet Andersson, Jennifer R.
Moya, Jose Alonso
Schwickerath, Ulrich
author_sort Andersson, Jennifer R.
collection PubMed
description For several years CERN has been offering a centralised service for Elasticsearch, a popular distributed system for search and analytics of user provided data. The service offered by CERN IT is better described as a service of services, delivering centrally managed and maintained Elasticsearch instances to CERN users who have a justified need for it. This dynamic infrastructure currently consists of about 30 distinct and independent Elasticsearch installations, in the following referred to as Elasticsearch clusters, some of which are shared between different user communities. The service is used by several hundred users mainly for logs and service analytics. Due to its size and complexity, the installation produces a huge amount of internal monitoring data which can be difficult to process in real time with limited available person power. Early on, an idea was therefore born to process this data automatically, aiming to extract anomalies and possible issues building up in real time, allowing the experts to address them before they start to cause an issue for the users of the service. Both deep learning and traditional methods have been applied to analyse the data in order to achieve this goal. This resulted in the current deployment of an anomaly detection system based on a one layer multi dimensional LSTM neural network, coupled with applying a simple moving average to the data to validate the results. This paper will describe which methods were investigated and give an overview of the current system, including data retrieval, data pre-processing and analysis. In addition, reports on experiences gained when applying the system to actual data will be provided. Finally, weaknesses of the current system will be briefly discussed, and ideas for future system improvements will be sketched out.
format Online
Article
Text
id pubmed-8637786
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-86377862021-12-03 Anomaly Detection for the Centralised Elasticsearch Service at CERN Andersson, Jennifer R. Moya, Jose Alonso Schwickerath, Ulrich Front Big Data Big Data For several years CERN has been offering a centralised service for Elasticsearch, a popular distributed system for search and analytics of user provided data. The service offered by CERN IT is better described as a service of services, delivering centrally managed and maintained Elasticsearch instances to CERN users who have a justified need for it. This dynamic infrastructure currently consists of about 30 distinct and independent Elasticsearch installations, in the following referred to as Elasticsearch clusters, some of which are shared between different user communities. The service is used by several hundred users mainly for logs and service analytics. Due to its size and complexity, the installation produces a huge amount of internal monitoring data which can be difficult to process in real time with limited available person power. Early on, an idea was therefore born to process this data automatically, aiming to extract anomalies and possible issues building up in real time, allowing the experts to address them before they start to cause an issue for the users of the service. Both deep learning and traditional methods have been applied to analyse the data in order to achieve this goal. This resulted in the current deployment of an anomaly detection system based on a one layer multi dimensional LSTM neural network, coupled with applying a simple moving average to the data to validate the results. This paper will describe which methods were investigated and give an overview of the current system, including data retrieval, data pre-processing and analysis. In addition, reports on experiences gained when applying the system to actual data will be provided. Finally, weaknesses of the current system will be briefly discussed, and ideas for future system improvements will be sketched out. Frontiers Media S.A. 2021-11-16 /pmc/articles/PMC8637786/ /pubmed/34870189 http://dx.doi.org/10.3389/fdata.2021.718879 Text en Copyright © 2021 Andersson, Moya and Schwickerath. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Big Data
Andersson, Jennifer R.
Moya, Jose Alonso
Schwickerath, Ulrich
Anomaly Detection for the Centralised Elasticsearch Service at CERN
title Anomaly Detection for the Centralised Elasticsearch Service at CERN
title_full Anomaly Detection for the Centralised Elasticsearch Service at CERN
title_fullStr Anomaly Detection for the Centralised Elasticsearch Service at CERN
title_full_unstemmed Anomaly Detection for the Centralised Elasticsearch Service at CERN
title_short Anomaly Detection for the Centralised Elasticsearch Service at CERN
title_sort anomaly detection for the centralised elasticsearch service at cern
topic Big Data
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8637786/
https://www.ncbi.nlm.nih.gov/pubmed/34870189
http://dx.doi.org/10.3389/fdata.2021.718879
work_keys_str_mv AT anderssonjenniferr anomalydetectionforthecentralisedelasticsearchserviceatcern
AT moyajosealonso anomalydetectionforthecentralisedelasticsearchserviceatcern
AT schwickerathulrich anomalydetectionforthecentralisedelasticsearchserviceatcern