Cargando…

Top-level DB design for Big Data in ATLAS Experiment at CERN

This presentation describes a system that accumulates a set of key quantities for a very large number of particle collision events recorded by the ATLAS experiment at the LHC (Large Hadron Collider) at CERN. The main project requirements are the handling of tens of billions of rows per year with min...

Descripción completa

Detalles Bibliográficos
Autores principales: Dimitrov, Gancho, Gallas, Elizabeth, Vasileva, Petya Tsvetanova
Lenguaje:eng
Publicado: 2017
Materias:
Acceso en línea:http://cds.cern.ch/record/2292857
_version_ 1780956497317986304
author Dimitrov, Gancho
Gallas, Elizabeth
Vasileva, Petya Tsvetanova
author_facet Dimitrov, Gancho
Gallas, Elizabeth
Vasileva, Petya Tsvetanova
author_sort Dimitrov, Gancho
collection CERN
description This presentation describes a system that accumulates a set of key quantities for a very large number of particle collision events recorded by the ATLAS experiment at the LHC (Large Hadron Collider) at CERN. The main project requirements are the handling of tens of billions of rows per year with minimal DB resources, and providing outstanding performance for the fundamental use cases. Various challenges were faced in the process of project development, such as large data volume, large transactions (tens to hundreds of million of rows per transaction) requiring significant amount of undo, row duplication checks, adequate table statistics gathering, and SQL execution plan stability. Currently the system hosts about 120 billion rows as the data ingestion rate has gone beyond the initially foreseen 30 billion rows per year. The crucial DB schema design decisions and the Oracle DB features and techniques will be shared with the audience. By attending this session you will learn how big physics data can be organized in a very efficient way in order to become small-sized physics data.
id cern-2292857
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2017
record_format invenio
spelling cern-22928572019-09-30T06:29:59Zhttp://cds.cern.ch/record/2292857engDimitrov, GanchoGallas, ElizabethVasileva, Petya TsvetanovaTop-level DB design for Big Data in ATLAS Experiment at CERNParticle Physics - ExperimentThis presentation describes a system that accumulates a set of key quantities for a very large number of particle collision events recorded by the ATLAS experiment at the LHC (Large Hadron Collider) at CERN. The main project requirements are the handling of tens of billions of rows per year with minimal DB resources, and providing outstanding performance for the fundamental use cases. Various challenges were faced in the process of project development, such as large data volume, large transactions (tens to hundreds of million of rows per transaction) requiring significant amount of undo, row duplication checks, adequate table statistics gathering, and SQL execution plan stability. Currently the system hosts about 120 billion rows as the data ingestion rate has gone beyond the initially foreseen 30 billion rows per year. The crucial DB schema design decisions and the Oracle DB features and techniques will be shared with the audience. By attending this session you will learn how big physics data can be organized in a very efficient way in order to become small-sized physics data.ATL-SOFT-SLIDE-2017-955oai:cds.cern.ch:22928572017-11-14
spellingShingle Particle Physics - Experiment
Dimitrov, Gancho
Gallas, Elizabeth
Vasileva, Petya Tsvetanova
Top-level DB design for Big Data in ATLAS Experiment at CERN
title Top-level DB design for Big Data in ATLAS Experiment at CERN
title_full Top-level DB design for Big Data in ATLAS Experiment at CERN
title_fullStr Top-level DB design for Big Data in ATLAS Experiment at CERN
title_full_unstemmed Top-level DB design for Big Data in ATLAS Experiment at CERN
title_short Top-level DB design for Big Data in ATLAS Experiment at CERN
title_sort top-level db design for big data in atlas experiment at cern
topic Particle Physics - Experiment
url http://cds.cern.ch/record/2292857
work_keys_str_mv AT dimitrovgancho topleveldbdesignforbigdatainatlasexperimentatcern
AT gallaselizabeth topleveldbdesignforbigdatainatlasexperimentatcern
AT vasilevapetyatsvetanova topleveldbdesignforbigdatainatlasexperimentatcern