Cargando…
Top-level DB design for Big Data in ATLAS Experiment at CERN
This presentation describes a system that accumulates a set of key quantities for a very large number of particle collision events recorded by the ATLAS experiment at the LHC (Large Hadron Collider) at CERN. The main project requirements are the handling of tens of billions of rows per year with min...
Autores principales: | , , |
---|---|
Lenguaje: | eng |
Publicado: |
2017
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/2292857 |
_version_ | 1780956497317986304 |
---|---|
author | Dimitrov, Gancho Gallas, Elizabeth Vasileva, Petya Tsvetanova |
author_facet | Dimitrov, Gancho Gallas, Elizabeth Vasileva, Petya Tsvetanova |
author_sort | Dimitrov, Gancho |
collection | CERN |
description | This presentation describes a system that accumulates a set of key quantities for a very large number of particle collision events recorded by the ATLAS experiment at the LHC (Large Hadron Collider) at CERN. The main project requirements are the handling of tens of billions of rows per year with minimal DB resources, and providing outstanding performance for the fundamental use cases. Various challenges were faced in the process of project development, such as large data volume, large transactions (tens to hundreds of million of rows per transaction) requiring significant amount of undo, row duplication checks, adequate table statistics gathering, and SQL execution plan stability. Currently the system hosts about 120 billion rows as the data ingestion rate has gone beyond the initially foreseen 30 billion rows per year. The crucial DB schema design decisions and the Oracle DB features and techniques will be shared with the audience. By attending this session you will learn how big physics data can be organized in a very efficient way in order to become small-sized physics data. |
id | cern-2292857 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2017 |
record_format | invenio |
spelling | cern-22928572019-09-30T06:29:59Zhttp://cds.cern.ch/record/2292857engDimitrov, GanchoGallas, ElizabethVasileva, Petya TsvetanovaTop-level DB design for Big Data in ATLAS Experiment at CERNParticle Physics - ExperimentThis presentation describes a system that accumulates a set of key quantities for a very large number of particle collision events recorded by the ATLAS experiment at the LHC (Large Hadron Collider) at CERN. The main project requirements are the handling of tens of billions of rows per year with minimal DB resources, and providing outstanding performance for the fundamental use cases. Various challenges were faced in the process of project development, such as large data volume, large transactions (tens to hundreds of million of rows per transaction) requiring significant amount of undo, row duplication checks, adequate table statistics gathering, and SQL execution plan stability. Currently the system hosts about 120 billion rows as the data ingestion rate has gone beyond the initially foreseen 30 billion rows per year. The crucial DB schema design decisions and the Oracle DB features and techniques will be shared with the audience. By attending this session you will learn how big physics data can be organized in a very efficient way in order to become small-sized physics data.ATL-SOFT-SLIDE-2017-955oai:cds.cern.ch:22928572017-11-14 |
spellingShingle | Particle Physics - Experiment Dimitrov, Gancho Gallas, Elizabeth Vasileva, Petya Tsvetanova Top-level DB design for Big Data in ATLAS Experiment at CERN |
title | Top-level DB design for Big Data in ATLAS Experiment at CERN |
title_full | Top-level DB design for Big Data in ATLAS Experiment at CERN |
title_fullStr | Top-level DB design for Big Data in ATLAS Experiment at CERN |
title_full_unstemmed | Top-level DB design for Big Data in ATLAS Experiment at CERN |
title_short | Top-level DB design for Big Data in ATLAS Experiment at CERN |
title_sort | top-level db design for big data in atlas experiment at cern |
topic | Particle Physics - Experiment |
url | http://cds.cern.ch/record/2292857 |
work_keys_str_mv | AT dimitrovgancho topleveldbdesignforbigdatainatlasexperimentatcern AT gallaselizabeth topleveldbdesignforbigdatainatlasexperimentatcern AT vasilevapetyatsvetanova topleveldbdesignforbigdatainatlasexperimentatcern |