Cargando…
An Information Aggregation and Analytics System for ATLAS Frontier.
ATLAS event processing requires access to centralized database systems where information about calibrations, detector status and data-taking conditions are stored. This processing is done on more than 150 computing sites on a world-wide computing grid which are able to access the database using the...
Autores principales: | , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2019
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/2694040 |
_version_ | 1780964064739983360 |
---|---|
author | Formica, Andrea Ozturk, Nurcan Gallas, Elizabeth Vukotic, Ilija Lozano Bahilo, Jose Julio Si Amer, Millissa |
author_facet | Formica, Andrea Ozturk, Nurcan Gallas, Elizabeth Vukotic, Ilija Lozano Bahilo, Jose Julio Si Amer, Millissa |
author_sort | Formica, Andrea |
collection | CERN |
description | ATLAS event processing requires access to centralized database systems where information about calibrations, detector status and data-taking conditions are stored. This processing is done on more than 150 computing sites on a world-wide computing grid which are able to access the database using the squid-Frontier system. Some processing workflows have been found which overload the Frontier system due to the Conditions data model currently in use, specifically because some of the Conditions data requests have been found to have a low caching efficiency. The underlying cause is that non-identical requests as far as the caching are actually retrieving a much smaller number of unique payloads. While ATLAS is undertaking an adiabatic transition during LS2 and Run-3 from the current COOL Conditions data model to a new data model called CREST for Run 4, it is important to identify the problematic Conditions queries with low caching efficiency and work with the detector subsystems to improve the storage of such data within the current data model. For this purpose ATLAS put together an information aggregation and analytics system. The system is based on aggregated data from the squid-Frontier logs using the Elastic Search technology. This talk describes the components of this analytics system from the server based on Flask/Celery application to the user interface and how we use Spark SQL functionalities to filter data for making plots, storing the caching efficiency results into a PostgreSQL database and finally deploying the package via a Docker container. |
id | cern-2694040 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2019 |
record_format | invenio |
spelling | cern-26940402019-10-29T10:08:38Zhttp://cds.cern.ch/record/2694040engFormica, AndreaOzturk, NurcanGallas, ElizabethVukotic, IlijaLozano Bahilo, Jose JulioSi Amer, MillissaAn Information Aggregation and Analytics System for ATLAS Frontier.Particle Physics - ExperimentATLAS event processing requires access to centralized database systems where information about calibrations, detector status and data-taking conditions are stored. This processing is done on more than 150 computing sites on a world-wide computing grid which are able to access the database using the squid-Frontier system. Some processing workflows have been found which overload the Frontier system due to the Conditions data model currently in use, specifically because some of the Conditions data requests have been found to have a low caching efficiency. The underlying cause is that non-identical requests as far as the caching are actually retrieving a much smaller number of unique payloads. While ATLAS is undertaking an adiabatic transition during LS2 and Run-3 from the current COOL Conditions data model to a new data model called CREST for Run 4, it is important to identify the problematic Conditions queries with low caching efficiency and work with the detector subsystems to improve the storage of such data within the current data model. For this purpose ATLAS put together an information aggregation and analytics system. The system is based on aggregated data from the squid-Frontier logs using the Elastic Search technology. This talk describes the components of this analytics system from the server based on Flask/Celery application to the user interface and how we use Spark SQL functionalities to filter data for making plots, storing the caching efficiency results into a PostgreSQL database and finally deploying the package via a Docker container.ATL-SOFT-SLIDE-2019-778oai:cds.cern.ch:26940402019-10-18 |
spellingShingle | Particle Physics - Experiment Formica, Andrea Ozturk, Nurcan Gallas, Elizabeth Vukotic, Ilija Lozano Bahilo, Jose Julio Si Amer, Millissa An Information Aggregation and Analytics System for ATLAS Frontier. |
title | An Information Aggregation and Analytics System for ATLAS Frontier. |
title_full | An Information Aggregation and Analytics System for ATLAS Frontier. |
title_fullStr | An Information Aggregation and Analytics System for ATLAS Frontier. |
title_full_unstemmed | An Information Aggregation and Analytics System for ATLAS Frontier. |
title_short | An Information Aggregation and Analytics System for ATLAS Frontier. |
title_sort | information aggregation and analytics system for atlas frontier. |
topic | Particle Physics - Experiment |
url | http://cds.cern.ch/record/2694040 |
work_keys_str_mv | AT formicaandrea aninformationaggregationandanalyticssystemforatlasfrontier AT ozturknurcan aninformationaggregationandanalyticssystemforatlasfrontier AT gallaselizabeth aninformationaggregationandanalyticssystemforatlasfrontier AT vukoticilija aninformationaggregationandanalyticssystemforatlasfrontier AT lozanobahilojosejulio aninformationaggregationandanalyticssystemforatlasfrontier AT siamermillissa aninformationaggregationandanalyticssystemforatlasfrontier AT formicaandrea informationaggregationandanalyticssystemforatlasfrontier AT ozturknurcan informationaggregationandanalyticssystemforatlasfrontier AT gallaselizabeth informationaggregationandanalyticssystemforatlasfrontier AT vukoticilija informationaggregationandanalyticssystemforatlasfrontier AT lozanobahilojosejulio informationaggregationandanalyticssystemforatlasfrontier AT siamermillissa informationaggregationandanalyticssystemforatlasfrontier |