Cargando…

A New Approach to Querying and Processing Data from CERN Accelerator Logging Service

During the previous 10 years, the CERN Accelerator Logging Service has evolved multiple times i.e. from the expected 1TB of data per year in the beginning to more than 50TB/year. It is used to store and retrieve billions of data acquisitions per day, from across the complete CERN accelerator comple...

Descripción completa

Detalles Bibliográficos
Autor principal: Cakaric, Faris
Lenguaje:eng
Publicado: 2015
Materias:
Acceso en línea:http://cds.cern.ch/record/2046075
_version_ 1780947926475866112
author Cakaric, Faris
author_facet Cakaric, Faris
author_sort Cakaric, Faris
collection CERN
description During the previous 10 years, the CERN Accelerator Logging Service has evolved multiple times i.e. from the expected 1TB of data per year in the beginning to more than 50TB/year. It is used to store and retrieve billions of data acquisitions per day, from across the complete CERN accelerator complex, related subsystems, and experiments. This report includes a description of possible ways of improving the speed of the data retrieval from the CALS service. A short overview of the the possible technologies that can be used i.e. Apache Flume and Apache Spark is given, describing the possible ways in which these two technologies can be implemented.
id cern-2046075
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2015
record_format invenio
spelling cern-20460752019-09-30T06:29:59Zhttp://cds.cern.ch/record/2046075engCakaric, FarisA New Approach to Querying and Processing Data from CERN Accelerator Logging ServiceComputing and ComputersDuring the previous 10 years, the CERN Accelerator Logging Service has evolved multiple times i.e. from the expected 1TB of data per year in the beginning to more than 50TB/year. It is used to store and retrieve billions of data acquisitions per day, from across the complete CERN accelerator complex, related subsystems, and experiments. This report includes a description of possible ways of improving the speed of the data retrieval from the CALS service. A short overview of the the possible technologies that can be used i.e. Apache Flume and Apache Spark is given, describing the possible ways in which these two technologies can be implemented.CERN-STUDENTS-Note-2015-087oai:cds.cern.ch:20460752015-08-21
spellingShingle Computing and Computers
Cakaric, Faris
A New Approach to Querying and Processing Data from CERN Accelerator Logging Service
title A New Approach to Querying and Processing Data from CERN Accelerator Logging Service
title_full A New Approach to Querying and Processing Data from CERN Accelerator Logging Service
title_fullStr A New Approach to Querying and Processing Data from CERN Accelerator Logging Service
title_full_unstemmed A New Approach to Querying and Processing Data from CERN Accelerator Logging Service
title_short A New Approach to Querying and Processing Data from CERN Accelerator Logging Service
title_sort new approach to querying and processing data from cern accelerator logging service
topic Computing and Computers
url http://cds.cern.ch/record/2046075
work_keys_str_mv AT cakaricfaris anewapproachtoqueryingandprocessingdatafromcernacceleratorloggingservice
AT cakaricfaris newapproachtoqueryingandprocessingdatafromcernacceleratorloggingservice