Cargando…

Collecting and Storing Data Flow Monitoring in Elasticsearch

A very large amount of data is produced from the online data flow monitoring for the CMS data acquisition system. However, there are only a small portion of data is stored permanently in the relational database. This is because of the high cost needed while relying on the dedicated infrastructure as...

Descripción completa

Detalles Bibliográficos
Autor principal: Hashim, Fatin Hazwani
Lenguaje:eng
Publicado: 2014
Materias:
Acceso en línea:http://cds.cern.ch/record/1751427
_version_ 1780943148533415936
author Hashim, Fatin Hazwani
author_facet Hashim, Fatin Hazwani
author_sort Hashim, Fatin Hazwani
collection CERN
description A very large amount of data is produced from the online data flow monitoring for the CMS data acquisition system. However, there are only a small portion of data is stored permanently in the relational database. This is because of the high cost needed while relying on the dedicated infrastructure as well as the issues in its performance itself. A new approach needs to be found in order to confront such a big volume of data known as “Big Data”. The Big Data [1] is the term given to the very large and complex data sets that cannot be handled by the traditional data processing application [2] in terms of capturing, storing, managing, and analyzing. The sheer size of the data [3] in CMS data acquisition system is one of the major challenges, and is the one of the most easily recognized. New technology need to be used as the alternative of the traditional databases initial evaluation to handle this problem as more data need to be stored permanently and can be easily retrieved. This report consists of the introduction of Elasticsearch as well as its application in collecting and storing data flow monitoring.
id cern-1751427
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2014
record_format invenio
spelling cern-17514272019-09-30T06:29:59Zhttp://cds.cern.ch/record/1751427engHashim, Fatin HazwaniCollecting and Storing Data Flow Monitoring in ElasticsearchComputing and ComputersInformation Transfer and ManagementA very large amount of data is produced from the online data flow monitoring for the CMS data acquisition system. However, there are only a small portion of data is stored permanently in the relational database. This is because of the high cost needed while relying on the dedicated infrastructure as well as the issues in its performance itself. A new approach needs to be found in order to confront such a big volume of data known as “Big Data”. The Big Data [1] is the term given to the very large and complex data sets that cannot be handled by the traditional data processing application [2] in terms of capturing, storing, managing, and analyzing. The sheer size of the data [3] in CMS data acquisition system is one of the major challenges, and is the one of the most easily recognized. New technology need to be used as the alternative of the traditional databases initial evaluation to handle this problem as more data need to be stored permanently and can be easily retrieved. This report consists of the introduction of Elasticsearch as well as its application in collecting and storing data flow monitoring.CERN-STUDENTS-Note-2014-098oai:cds.cern.ch:17514272014-08-22
spellingShingle Computing and Computers
Information Transfer and Management
Hashim, Fatin Hazwani
Collecting and Storing Data Flow Monitoring in Elasticsearch
title Collecting and Storing Data Flow Monitoring in Elasticsearch
title_full Collecting and Storing Data Flow Monitoring in Elasticsearch
title_fullStr Collecting and Storing Data Flow Monitoring in Elasticsearch
title_full_unstemmed Collecting and Storing Data Flow Monitoring in Elasticsearch
title_short Collecting and Storing Data Flow Monitoring in Elasticsearch
title_sort collecting and storing data flow monitoring in elasticsearch
topic Computing and Computers
Information Transfer and Management
url http://cds.cern.ch/record/1751427
work_keys_str_mv AT hashimfatinhazwani collectingandstoringdataflowmonitoringinelasticsearch