Cargando…

Big data analytics as a service infrastructure: challenges, desired properties and solutions

CERN's accelerator complex generates a very large amount of data. A large volumen of heterogeneous data is constantly generated from control equipment and monitoring agents. These data must be stored and analysed. Over the decades, CERN's researching and engineering teams have applied diff...

Descripción completa

Detalles Bibliográficos
Autor principal: Martín-Márquez, Manuel
Lenguaje:eng
Publicado: 2015
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/664/4/042034
http://cds.cern.ch/record/2134570
_version_ 1780949906871025664
author Martín-Márquez, Manuel
author_facet Martín-Márquez, Manuel
author_sort Martín-Márquez, Manuel
collection CERN
description CERN's accelerator complex generates a very large amount of data. A large volumen of heterogeneous data is constantly generated from control equipment and monitoring agents. These data must be stored and analysed. Over the decades, CERN's researching and engineering teams have applied different approaches, techniques and technologies for this purpose. This situation has minimised the necessary collaboration and, more relevantly, the cross data analytics over different domains. These two factors are essential to unlock hidden insights and correlations between the underlying processes, which enable better and more efficient daily-based accelerator operations and more informed decisions. The proposed Big Data Analytics as a Service Infrastructure aims to: (1) integrate the existing developments, (2) centralise and standardise the complex data analytics needs for CERN's research and engineering community, (3) deliver real-time, batch data analytics and information discovery capabilities, and (4) provide transparent access and Extract, Transform and Load (ETL), mechanisms to the various and mission-critical existing data repositories. This paper presents the desired objectives and properties resulting from the analysis of CERN's data analytics requirements, the main challenges: technological, collaborative and educational and, potential solutions.
id oai-inspirehep.net-1413865
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2015
record_format invenio
spelling oai-inspirehep.net-14138652022-08-10T13:00:53Zdoi:10.1088/1742-6596/664/4/042034http://cds.cern.ch/record/2134570engMartín-Márquez, ManuelBig data analytics as a service infrastructure: challenges, desired properties and solutionsComputing and ComputersCERN's accelerator complex generates a very large amount of data. A large volumen of heterogeneous data is constantly generated from control equipment and monitoring agents. These data must be stored and analysed. Over the decades, CERN's researching and engineering teams have applied different approaches, techniques and technologies for this purpose. This situation has minimised the necessary collaboration and, more relevantly, the cross data analytics over different domains. These two factors are essential to unlock hidden insights and correlations between the underlying processes, which enable better and more efficient daily-based accelerator operations and more informed decisions. The proposed Big Data Analytics as a Service Infrastructure aims to: (1) integrate the existing developments, (2) centralise and standardise the complex data analytics needs for CERN's research and engineering community, (3) deliver real-time, batch data analytics and information discovery capabilities, and (4) provide transparent access and Extract, Transform and Load (ETL), mechanisms to the various and mission-critical existing data repositories. This paper presents the desired objectives and properties resulting from the analysis of CERN's data analytics requirements, the main challenges: technological, collaborative and educational and, potential solutions.oai:inspirehep.net:14138652015
spellingShingle Computing and Computers
Martín-Márquez, Manuel
Big data analytics as a service infrastructure: challenges, desired properties and solutions
title Big data analytics as a service infrastructure: challenges, desired properties and solutions
title_full Big data analytics as a service infrastructure: challenges, desired properties and solutions
title_fullStr Big data analytics as a service infrastructure: challenges, desired properties and solutions
title_full_unstemmed Big data analytics as a service infrastructure: challenges, desired properties and solutions
title_short Big data analytics as a service infrastructure: challenges, desired properties and solutions
title_sort big data analytics as a service infrastructure: challenges, desired properties and solutions
topic Computing and Computers
url https://dx.doi.org/10.1088/1742-6596/664/4/042034
http://cds.cern.ch/record/2134570
work_keys_str_mv AT martinmarquezmanuel bigdataanalyticsasaserviceinfrastructurechallengesdesiredpropertiesandsolutions