Cargando…
An Oracle-based event index for ATLAS
The ATLAS Eventlndex System has amassed a set of key quantities for a large number of ATLAS events into a Hadoop based infrastructure for the purpose of providing the experiment with a number of event-wise services. Collecting this data in one place provides the opportunity to investigate various st...
Autores principales: | , , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2017
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1088/1742-6596/898/4/042033 http://cds.cern.ch/record/2252389 |
_version_ | 1780953509036818432 |
---|---|
author | Gallas, Elizabeth Dimitrov, Gancho Petrova, Petya Tsvetanova Baranowski, Zbigniew Canali, Luca Dumitru, Andrei Formica, Andrea |
author_facet | Gallas, Elizabeth Dimitrov, Gancho Petrova, Petya Tsvetanova Baranowski, Zbigniew Canali, Luca Dumitru, Andrei Formica, Andrea |
author_sort | Gallas, Elizabeth |
collection | CERN |
description | The ATLAS Eventlndex System has amassed a set of key quantities for a large number of ATLAS events into a Hadoop based infrastructure for the purpose of providing the experiment with a number of event-wise services. Collecting this data in one place provides the opportunity to investigate various storage formats and technologies and assess which best serve the various use cases as well as consider what other benefits alternative storage systems provide. In this presentation we describe how the data are imported into an Oracle RDBMS (relational database management system), the services we have built based on this architecture, and our experience with it. We’ve indexed about 26 billion real data events thus far and have designed the system to accommodate future data which has expected rates of 5 and 20 billion events per year. We have found this system offers outstanding performance for some fundamental use cases. In addition, profiting from the co-location of this data with other complementary metadata in ATLAS, the system has been easily extended to perform essential assessments of data integrity and completeness and to identify event duplication, including at what step in processing the duplication occurred. |
id | cern-2252389 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2017 |
record_format | invenio |
spelling | cern-22523892019-10-15T15:19:10Zdoi:10.1088/1742-6596/898/4/042033http://cds.cern.ch/record/2252389engGallas, ElizabethDimitrov, GanchoPetrova, Petya TsvetanovaBaranowski, ZbigniewCanali, LucaDumitru, AndreiFormica, AndreaAn Oracle-based event index for ATLASParticle Physics - ExperimentThe ATLAS Eventlndex System has amassed a set of key quantities for a large number of ATLAS events into a Hadoop based infrastructure for the purpose of providing the experiment with a number of event-wise services. Collecting this data in one place provides the opportunity to investigate various storage formats and technologies and assess which best serve the various use cases as well as consider what other benefits alternative storage systems provide. In this presentation we describe how the data are imported into an Oracle RDBMS (relational database management system), the services we have built based on this architecture, and our experience with it. We’ve indexed about 26 billion real data events thus far and have designed the system to accommodate future data which has expected rates of 5 and 20 billion events per year. We have found this system offers outstanding performance for some fundamental use cases. In addition, profiting from the co-location of this data with other complementary metadata in ATLAS, the system has been easily extended to perform essential assessments of data integrity and completeness and to identify event duplication, including at what step in processing the duplication occurred.ATL-SOFT-PROC-2017-048oai:cds.cern.ch:22523892017-02-14 |
spellingShingle | Particle Physics - Experiment Gallas, Elizabeth Dimitrov, Gancho Petrova, Petya Tsvetanova Baranowski, Zbigniew Canali, Luca Dumitru, Andrei Formica, Andrea An Oracle-based event index for ATLAS |
title | An Oracle-based event index for ATLAS |
title_full | An Oracle-based event index for ATLAS |
title_fullStr | An Oracle-based event index for ATLAS |
title_full_unstemmed | An Oracle-based event index for ATLAS |
title_short | An Oracle-based event index for ATLAS |
title_sort | oracle-based event index for atlas |
topic | Particle Physics - Experiment |
url | https://dx.doi.org/10.1088/1742-6596/898/4/042033 http://cds.cern.ch/record/2252389 |
work_keys_str_mv | AT gallaselizabeth anoraclebasedeventindexforatlas AT dimitrovgancho anoraclebasedeventindexforatlas AT petrovapetyatsvetanova anoraclebasedeventindexforatlas AT baranowskizbigniew anoraclebasedeventindexforatlas AT canaliluca anoraclebasedeventindexforatlas AT dumitruandrei anoraclebasedeventindexforatlas AT formicaandrea anoraclebasedeventindexforatlas AT gallaselizabeth oraclebasedeventindexforatlas AT dimitrovgancho oraclebasedeventindexforatlas AT petrovapetyatsvetanova oraclebasedeventindexforatlas AT baranowskizbigniew oraclebasedeventindexforatlas AT canaliluca oraclebasedeventindexforatlas AT dumitruandrei oraclebasedeventindexforatlas AT formicaandrea oraclebasedeventindexforatlas |