Cargando…

Monitoring WLCG with lambda-architecture: a new scalable data store and analytics platform for monitoring at petabyte scale.

Monitoring the WLCG infrastructure requires the gathering and analysis of a high volume of heterogeneous data (e.g. data transfers, job monitoring, site tests) coming from different services and experiment-specific frameworks to provide a uniform and flexible interface for scientists and sites. The...

Descripción completa

Detalles Bibliográficos
Autores principales: Magnoni, L, Suthakar, U, Cordeiro, C, Georgiou, M, Andreeva, J, Khan, A, Smith, D R
Lenguaje:eng
Publicado: 2015
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/664/5/052023
http://cds.cern.ch/record/2134588
_version_ 1780949910902800384
author Magnoni, L
Suthakar, U
Cordeiro, C
Georgiou, M
Andreeva, J
Khan, A
Smith, D R
author_facet Magnoni, L
Suthakar, U
Cordeiro, C
Georgiou, M
Andreeva, J
Khan, A
Smith, D R
author_sort Magnoni, L
collection CERN
description Monitoring the WLCG infrastructure requires the gathering and analysis of a high volume of heterogeneous data (e.g. data transfers, job monitoring, site tests) coming from different services and experiment-specific frameworks to provide a uniform and flexible interface for scientists and sites. The current architecture, where relational database systems are used to store, to process and to serve monitoring data, has limitations in coping with the foreseen increase in the volume (e.g. higher LHC luminosity) and the variety (e.g. new data-transfer protocols and new resource-types, as cloud-computing) of WLCG monitoring events. This paper presents a new scalable data store and analytics platform designed by the Support for Distributed Computing (SDC) group, at the CERN IT department, which uses a variety of technologies each one targeting specific aspects of big-scale distributed data-processing (commonly referred as lambda-architecture approach). Results of data processing on Hadoop for WLCG data activities monitoring are presented, showing how the new architecture can easily analyze hundreds of millions of transfer logs in a few minutes. Moreover, a comparison of data partitioning, compression and file format (e.g. CSV, Avro) is presented, with particular attention given to how the file structure impacts the overall MapReduce performance. In conclusion, the evolution of the current implementation, which focuses on data storage and batch processing, towards a complete lambda-architecture is discussed, with consideration of candidate technology for the serving layer (e.g. Elasticsearch) and a description of a proof of concept implementation, based on Apache Spark and Esper, for the real-time part which compensates for batch-processing latency and automates problem detection and failures.
id oai-inspirehep.net-1413905
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2015
record_format invenio
spelling oai-inspirehep.net-14139052022-08-10T13:00:57Zdoi:10.1088/1742-6596/664/5/052023http://cds.cern.ch/record/2134588engMagnoni, LSuthakar, UCordeiro, CGeorgiou, MAndreeva, JKhan, ASmith, D RMonitoring WLCG with lambda-architecture: a new scalable data store and analytics platform for monitoring at petabyte scale.Computing and ComputersMonitoring the WLCG infrastructure requires the gathering and analysis of a high volume of heterogeneous data (e.g. data transfers, job monitoring, site tests) coming from different services and experiment-specific frameworks to provide a uniform and flexible interface for scientists and sites. The current architecture, where relational database systems are used to store, to process and to serve monitoring data, has limitations in coping with the foreseen increase in the volume (e.g. higher LHC luminosity) and the variety (e.g. new data-transfer protocols and new resource-types, as cloud-computing) of WLCG monitoring events. This paper presents a new scalable data store and analytics platform designed by the Support for Distributed Computing (SDC) group, at the CERN IT department, which uses a variety of technologies each one targeting specific aspects of big-scale distributed data-processing (commonly referred as lambda-architecture approach). Results of data processing on Hadoop for WLCG data activities monitoring are presented, showing how the new architecture can easily analyze hundreds of millions of transfer logs in a few minutes. Moreover, a comparison of data partitioning, compression and file format (e.g. CSV, Avro) is presented, with particular attention given to how the file structure impacts the overall MapReduce performance. In conclusion, the evolution of the current implementation, which focuses on data storage and batch processing, towards a complete lambda-architecture is discussed, with consideration of candidate technology for the serving layer (e.g. Elasticsearch) and a description of a proof of concept implementation, based on Apache Spark and Esper, for the real-time part which compensates for batch-processing latency and automates problem detection and failures.oai:inspirehep.net:14139052015
spellingShingle Computing and Computers
Magnoni, L
Suthakar, U
Cordeiro, C
Georgiou, M
Andreeva, J
Khan, A
Smith, D R
Monitoring WLCG with lambda-architecture: a new scalable data store and analytics platform for monitoring at petabyte scale.
title Monitoring WLCG with lambda-architecture: a new scalable data store and analytics platform for monitoring at petabyte scale.
title_full Monitoring WLCG with lambda-architecture: a new scalable data store and analytics platform for monitoring at petabyte scale.
title_fullStr Monitoring WLCG with lambda-architecture: a new scalable data store and analytics platform for monitoring at petabyte scale.
title_full_unstemmed Monitoring WLCG with lambda-architecture: a new scalable data store and analytics platform for monitoring at petabyte scale.
title_short Monitoring WLCG with lambda-architecture: a new scalable data store and analytics platform for monitoring at petabyte scale.
title_sort monitoring wlcg with lambda-architecture: a new scalable data store and analytics platform for monitoring at petabyte scale.
topic Computing and Computers
url https://dx.doi.org/10.1088/1742-6596/664/5/052023
http://cds.cern.ch/record/2134588
work_keys_str_mv AT magnonil monitoringwlcgwithlambdaarchitectureanewscalabledatastoreandanalyticsplatformformonitoringatpetabytescale
AT suthakaru monitoringwlcgwithlambdaarchitectureanewscalabledatastoreandanalyticsplatformformonitoringatpetabytescale
AT cordeiroc monitoringwlcgwithlambdaarchitectureanewscalabledatastoreandanalyticsplatformformonitoringatpetabytescale
AT georgioum monitoringwlcgwithlambdaarchitectureanewscalabledatastoreandanalyticsplatformformonitoringatpetabytescale
AT andreevaj monitoringwlcgwithlambdaarchitectureanewscalabledatastoreandanalyticsplatformformonitoringatpetabytescale
AT khana monitoringwlcgwithlambdaarchitectureanewscalabledatastoreandanalyticsplatformformonitoringatpetabytescale
AT smithdr monitoringwlcgwithlambdaarchitectureanewscalabledatastoreandanalyticsplatformformonitoringatpetabytescale