Cargando…

The BigPanDA Monitoring System Architecture

Currently-running large-scale scientific projects involve unprecedented amounts of data and computing power. For example, the ATLAS experiment at the Large Hadron Collider (LHC) has collected 140 PB of data over the course of Run 1 and this value increases at rate of ~800MB/s during the ongoing Run...

Descripción completa

Detalles Bibliográficos
Autores principales: Korchuganova, Tatiana, Padolski, Siarhei, Wenaus, Torre, Klimentov, Alexei, Alekseev, Aleksandr
Lenguaje:eng
Publicado: 2018
Materias:
Acceso en línea:http://cds.cern.ch/record/2637585
_version_ 1780959941420384256
author Korchuganova, Tatiana
Padolski, Siarhei
Wenaus, Torre
Klimentov, Alexei
Alekseev, Aleksandr
author_facet Korchuganova, Tatiana
Padolski, Siarhei
Wenaus, Torre
Klimentov, Alexei
Alekseev, Aleksandr
author_sort Korchuganova, Tatiana
collection CERN
description Currently-running large-scale scientific projects involve unprecedented amounts of data and computing power. For example, the ATLAS experiment at the Large Hadron Collider (LHC) has collected 140 PB of data over the course of Run 1 and this value increases at rate of ~800MB/s during the ongoing Run 2. Processing and analysis of such amounts of data demands development of complex operational workflow and payload systems along with building top edge computing facilities. In the ATLAS experiment a key element of the payload management is the Production and Distributed Analysis system (PanDA). It consists of several core components and one of them is the monitoring. The latter is responsible for providing a comprehensive and coherent view of the tasks and jobs executed by the system, from high level summaries to detailed drill-down job diagnostics. The BigPanDA monitoring has been in production since the middle of 2014 and it continuously evolves to satisfy increasing demands in functionality and growing payload scales. Today it effectively keeps track of more than 2 million jobs per day distributed over 170 computing centers worldwide in the largest instance of the BigPanDA monitoring: the ATLAS experiment. In this paper we describe the monitoring architecture and its principal features.
id cern-2637585
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2018
record_format invenio
spelling cern-26375852019-09-30T06:29:59Zhttp://cds.cern.ch/record/2637585engKorchuganova, TatianaPadolski, SiarheiWenaus, TorreKlimentov, AlexeiAlekseev, AleksandrThe BigPanDA Monitoring System ArchitectureParticle Physics - ExperimentCurrently-running large-scale scientific projects involve unprecedented amounts of data and computing power. For example, the ATLAS experiment at the Large Hadron Collider (LHC) has collected 140 PB of data over the course of Run 1 and this value increases at rate of ~800MB/s during the ongoing Run 2. Processing and analysis of such amounts of data demands development of complex operational workflow and payload systems along with building top edge computing facilities. In the ATLAS experiment a key element of the payload management is the Production and Distributed Analysis system (PanDA). It consists of several core components and one of them is the monitoring. The latter is responsible for providing a comprehensive and coherent view of the tasks and jobs executed by the system, from high level summaries to detailed drill-down job diagnostics. The BigPanDA monitoring has been in production since the middle of 2014 and it continuously evolves to satisfy increasing demands in functionality and growing payload scales. Today it effectively keeps track of more than 2 million jobs per day distributed over 170 computing centers worldwide in the largest instance of the BigPanDA monitoring: the ATLAS experiment. In this paper we describe the monitoring architecture and its principal features.ATL-SOFT-SLIDE-2018-695oai:cds.cern.ch:26375852018-09-08
spellingShingle Particle Physics - Experiment
Korchuganova, Tatiana
Padolski, Siarhei
Wenaus, Torre
Klimentov, Alexei
Alekseev, Aleksandr
The BigPanDA Monitoring System Architecture
title The BigPanDA Monitoring System Architecture
title_full The BigPanDA Monitoring System Architecture
title_fullStr The BigPanDA Monitoring System Architecture
title_full_unstemmed The BigPanDA Monitoring System Architecture
title_short The BigPanDA Monitoring System Architecture
title_sort bigpanda monitoring system architecture
topic Particle Physics - Experiment
url http://cds.cern.ch/record/2637585
work_keys_str_mv AT korchuganovatatiana thebigpandamonitoringsystemarchitecture
AT padolskisiarhei thebigpandamonitoringsystemarchitecture
AT wenaustorre thebigpandamonitoringsystemarchitecture
AT klimentovalexei thebigpandamonitoringsystemarchitecture
AT alekseevaleksandr thebigpandamonitoringsystemarchitecture
AT korchuganovatatiana bigpandamonitoringsystemarchitecture
AT padolskisiarhei bigpandamonitoringsystemarchitecture
AT wenaustorre bigpandamonitoringsystemarchitecture
AT klimentovalexei bigpandamonitoringsystemarchitecture
AT alekseevaleksandr bigpandamonitoringsystemarchitecture