Cargando…

An Information Aggregation and Analytics System for ATLAS Frontier

ATLAS event processing requires access to centralized database systems where information about calibrations, detector status and data-taking conditions are stored. This processing is done on more than 150 computing sites on a world-wide computing grid which are able to access the database using the...

Descripción completa

Detalles Bibliográficos
Autores principales: Formica, Andrea, Ozturk, Nurcan, Si Amer, Millissa, Lozano Bahilo, Jose Julio, Vukotic, Ilija, Gallas, Elizabeth
Lenguaje:eng
Publicado: 2020
Materias:
Acceso en línea:https://dx.doi.org/10.1051/epjconf/202024504032
http://cds.cern.ch/record/2712916
_version_ 1780965319020380160
author Formica, Andrea
Ozturk, Nurcan
Si Amer, Millissa
Lozano Bahilo, Jose Julio
Vukotic, Ilija
Gallas, Elizabeth
author_facet Formica, Andrea
Ozturk, Nurcan
Si Amer, Millissa
Lozano Bahilo, Jose Julio
Vukotic, Ilija
Gallas, Elizabeth
author_sort Formica, Andrea
collection CERN
description ATLAS event processing requires access to centralized database systems where information about calibrations, detector status and data-taking conditions are stored. This processing is done on more than 150 computing sites on a world-wide computing grid which are able to access the database using the squid-Frontier system. Some processing workflows have been found which overload the Frontier system due to the Conditions data model currently in use, specifically because some of the Conditions data requests have been found to have a low caching efficiency. The underlying cause is that non-identical requests as far as the caching are actually retrieving a much smaller number of unique payloads. While ATLAS is undertaking an adiabatic transition during Long Shutdown 2 and Run 3 from the current COOL Conditions data model to a new data model called CREST for Run 4, it is important to identify the problematic Conditions queries with low caching efficiency and work with the detector subsystems to improve the storage of such data within the current data model. For this purpose ATLAS put together an information aggregation and analytics system. The system is based on aggregated data from the squid-Frontier logs using the Elasticsearch technology. This talk describes the components of this analytics system from the server based on Flask/Celery application to the user interface and how we use Spark SQL functionalities to filter data for making plots, storing the caching efficiency results into a Elasticsearch database and finally deploying the package via a Docker container.
id cern-2712916
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2020
record_format invenio
spelling cern-27129162022-10-20T21:05:02Zdoi:10.1051/epjconf/202024504032http://cds.cern.ch/record/2712916engFormica, AndreaOzturk, NurcanSi Amer, MillissaLozano Bahilo, Jose JulioVukotic, IlijaGallas, ElizabethAn Information Aggregation and Analytics System for ATLAS FrontierParticle Physics - ExperimentATLAS event processing requires access to centralized database systems where information about calibrations, detector status and data-taking conditions are stored. This processing is done on more than 150 computing sites on a world-wide computing grid which are able to access the database using the squid-Frontier system. Some processing workflows have been found which overload the Frontier system due to the Conditions data model currently in use, specifically because some of the Conditions data requests have been found to have a low caching efficiency. The underlying cause is that non-identical requests as far as the caching are actually retrieving a much smaller number of unique payloads. While ATLAS is undertaking an adiabatic transition during Long Shutdown 2 and Run 3 from the current COOL Conditions data model to a new data model called CREST for Run 4, it is important to identify the problematic Conditions queries with low caching efficiency and work with the detector subsystems to improve the storage of such data within the current data model. For this purpose ATLAS put together an information aggregation and analytics system. The system is based on aggregated data from the squid-Frontier logs using the Elasticsearch technology. This talk describes the components of this analytics system from the server based on Flask/Celery application to the user interface and how we use Spark SQL functionalities to filter data for making plots, storing the caching efficiency results into a Elasticsearch database and finally deploying the package via a Docker container.ATL-SOFT-PROC-2020-026oai:cds.cern.ch:27129162020-03-13
spellingShingle Particle Physics - Experiment
Formica, Andrea
Ozturk, Nurcan
Si Amer, Millissa
Lozano Bahilo, Jose Julio
Vukotic, Ilija
Gallas, Elizabeth
An Information Aggregation and Analytics System for ATLAS Frontier
title An Information Aggregation and Analytics System for ATLAS Frontier
title_full An Information Aggregation and Analytics System for ATLAS Frontier
title_fullStr An Information Aggregation and Analytics System for ATLAS Frontier
title_full_unstemmed An Information Aggregation and Analytics System for ATLAS Frontier
title_short An Information Aggregation and Analytics System for ATLAS Frontier
title_sort information aggregation and analytics system for atlas frontier
topic Particle Physics - Experiment
url https://dx.doi.org/10.1051/epjconf/202024504032
http://cds.cern.ch/record/2712916
work_keys_str_mv AT formicaandrea aninformationaggregationandanalyticssystemforatlasfrontier
AT ozturknurcan aninformationaggregationandanalyticssystemforatlasfrontier
AT siamermillissa aninformationaggregationandanalyticssystemforatlasfrontier
AT lozanobahilojosejulio aninformationaggregationandanalyticssystemforatlasfrontier
AT vukoticilija aninformationaggregationandanalyticssystemforatlasfrontier
AT gallaselizabeth aninformationaggregationandanalyticssystemforatlasfrontier
AT formicaandrea informationaggregationandanalyticssystemforatlasfrontier
AT ozturknurcan informationaggregationandanalyticssystemforatlasfrontier
AT siamermillissa informationaggregationandanalyticssystemforatlasfrontier
AT lozanobahilojosejulio informationaggregationandanalyticssystemforatlasfrontier
AT vukoticilija informationaggregationandanalyticssystemforatlasfrontier
AT gallaselizabeth informationaggregationandanalyticssystemforatlasfrontier