Cargando…

Resource Optimization for Log Shipper and Preprocessing Pipeline in a Large-Scale Logging System

In resource management, resource optimization is a usual technique to proceed for most professional organizations in order to reduce expenses and to dispose unnecessary resource usages. The European Organization for Nuclear Research (CERN) intends to implement a logging system based on AI for A Larg...

Descripción completa

Detalles Bibliográficos
Autores principales: Lertwuthikarn, Thanarit, Barroso, Vasco Chibante, Akkarajitsakul, Khajonpong
Lenguaje:eng
Publicado: 2022
Materias:
Acceso en línea:https://dx.doi.org/10.1109/ICKII55100.2022.9983590
http://cds.cern.ch/record/2846166
_version_ 1780976619480940544
author Lertwuthikarn, Thanarit
Barroso, Vasco Chibante
Akkarajitsakul, Khajonpong
author_facet Lertwuthikarn, Thanarit
Barroso, Vasco Chibante
Akkarajitsakul, Khajonpong
author_sort Lertwuthikarn, Thanarit
collection CERN
description In resource management, resource optimization is a usual technique to proceed for most professional organizations in order to reduce expenses and to dispose unnecessary resource usages. The European Organization for Nuclear Research (CERN) intends to implement a logging system based on AI for A Large Ion Collider Experiment detector, or ALICE. This system has been being implemented by using the Elasticsearch, Kibana, Beats, and Logstash also called ELK Stack which gives us the capability for the logs aggregation from systems and applications. Log data are collected from involved servers at CERN called First Level Processors (FLPs) nodes by Beats. These nodes run a large number of services when tasks are executed and generate a large volume of log data. Filebeat is used as a log shipper to transfer the data to Logstash, a server-side preprocessing pipeline. When Filebeat and Logstash are working together, there are many configurable factors affecting their efficiency. We then apply a factorial experiment to identify the significant factors and their correlation. These parameters are also optimized to find the best possible values of their configurations. Then, the resource usage can be minimized while a suitable performance of the system is maintained. The results of this study show that we can increase the efficiency of the system thanks to the adjusted values of the parameters. This can be used as a guideline for tuning some configurable parameters to optimize resource usage when there is a large amount of log data to be handled.
id cern-2846166
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2022
record_format invenio
spelling cern-28461662023-01-27T15:45:25Zdoi:10.1109/ICKII55100.2022.9983590http://cds.cern.ch/record/2846166engLertwuthikarn, ThanaritBarroso, Vasco ChibanteAkkarajitsakul, KhajonpongResource Optimization for Log Shipper and Preprocessing Pipeline in a Large-Scale Logging SystemComputing and ComputersInformation Transfer and ManagementIn resource management, resource optimization is a usual technique to proceed for most professional organizations in order to reduce expenses and to dispose unnecessary resource usages. The European Organization for Nuclear Research (CERN) intends to implement a logging system based on AI for A Large Ion Collider Experiment detector, or ALICE. This system has been being implemented by using the Elasticsearch, Kibana, Beats, and Logstash also called ELK Stack which gives us the capability for the logs aggregation from systems and applications. Log data are collected from involved servers at CERN called First Level Processors (FLPs) nodes by Beats. These nodes run a large number of services when tasks are executed and generate a large volume of log data. Filebeat is used as a log shipper to transfer the data to Logstash, a server-side preprocessing pipeline. When Filebeat and Logstash are working together, there are many configurable factors affecting their efficiency. We then apply a factorial experiment to identify the significant factors and their correlation. These parameters are also optimized to find the best possible values of their configurations. Then, the resource usage can be minimized while a suitable performance of the system is maintained. The results of this study show that we can increase the efficiency of the system thanks to the adjusted values of the parameters. This can be used as a guideline for tuning some configurable parameters to optimize resource usage when there is a large amount of log data to be handled.oai:cds.cern.ch:28461662022
spellingShingle Computing and Computers
Information Transfer and Management
Lertwuthikarn, Thanarit
Barroso, Vasco Chibante
Akkarajitsakul, Khajonpong
Resource Optimization for Log Shipper and Preprocessing Pipeline in a Large-Scale Logging System
title Resource Optimization for Log Shipper and Preprocessing Pipeline in a Large-Scale Logging System
title_full Resource Optimization for Log Shipper and Preprocessing Pipeline in a Large-Scale Logging System
title_fullStr Resource Optimization for Log Shipper and Preprocessing Pipeline in a Large-Scale Logging System
title_full_unstemmed Resource Optimization for Log Shipper and Preprocessing Pipeline in a Large-Scale Logging System
title_short Resource Optimization for Log Shipper and Preprocessing Pipeline in a Large-Scale Logging System
title_sort resource optimization for log shipper and preprocessing pipeline in a large-scale logging system
topic Computing and Computers
Information Transfer and Management
url https://dx.doi.org/10.1109/ICKII55100.2022.9983590
http://cds.cern.ch/record/2846166
work_keys_str_mv AT lertwuthikarnthanarit resourceoptimizationforlogshipperandpreprocessingpipelineinalargescaleloggingsystem
AT barrosovascochibante resourceoptimizationforlogshipperandpreprocessingpipelineinalargescaleloggingsystem
AT akkarajitsakulkhajonpong resourceoptimizationforlogshipperandpreprocessingpipelineinalargescaleloggingsystem