Cargando…

Structured storage in ATLAS Distributed Data Management: use cases and experiences

The distributed data management system of the high-energy physics experiment ATLAS has a critical dependency on the Oracle Relational Database Management System. Recently however, the increased appearance of data warehouse-like workload in the experiment has put considerable and increasing strain on...

Descripción completa

Detalles Bibliográficos
Autores principales: Lassnig, M, Garonne, V, Molfetas, A, Beermann, T, Dimitrov, G, Canali, L, Zang, D, Chinzer, LA
Lenguaje:eng
Publicado: 2012
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/396/5/052045
http://cds.cern.ch/record/1456345
_version_ 1780925085688791040
author Lassnig, M
Garonne, V
Molfetas, A
Beermann, T
Dimitrov, G
Canali, L
Zang, D
Chinzer, LA
author_facet Lassnig, M
Garonne, V
Molfetas, A
Beermann, T
Dimitrov, G
Canali, L
Zang, D
Chinzer, LA
author_sort Lassnig, M
collection CERN
description The distributed data management system of the high-energy physics experiment ATLAS has a critical dependency on the Oracle Relational Database Management System. Recently however, the increased appearance of data warehouse-like workload in the experiment has put considerable and increasing strain on the Oracle database. In particular, the analysis of archived data, and the aggregation of data for summary purposes has been especially demanding. For this reason, structured storage systems were evaluated to offload the Oracle database, and to handle processing of data in a non-transactional way. This includes distributed file systems like HDFS that support parallel execution of computational tasks on distributed data, as well as non-relational databases like HBase, Cassandra, or MongoDB. In this paper, the most important analysis and aggregation use cases of the data management system are presented, and how structured storage systems were established to process them.
id cern-1456345
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2012
record_format invenio
spelling cern-14563452019-09-30T06:29:59Zdoi:10.1088/1742-6596/396/5/052045http://cds.cern.ch/record/1456345engLassnig, MGaronne, VMolfetas, ABeermann, TDimitrov, GCanali, LZang, DChinzer, LAStructured storage in ATLAS Distributed Data Management: use cases and experiencesComputing and ComputersThe distributed data management system of the high-energy physics experiment ATLAS has a critical dependency on the Oracle Relational Database Management System. Recently however, the increased appearance of data warehouse-like workload in the experiment has put considerable and increasing strain on the Oracle database. In particular, the analysis of archived data, and the aggregation of data for summary purposes has been especially demanding. For this reason, structured storage systems were evaluated to offload the Oracle database, and to handle processing of data in a non-transactional way. This includes distributed file systems like HDFS that support parallel execution of computational tasks on distributed data, as well as non-relational databases like HBase, Cassandra, or MongoDB. In this paper, the most important analysis and aggregation use cases of the data management system are presented, and how structured storage systems were established to process them.ATL-SOFT-PROC-2012-051oai:cds.cern.ch:14563452012-06-18
spellingShingle Computing and Computers
Lassnig, M
Garonne, V
Molfetas, A
Beermann, T
Dimitrov, G
Canali, L
Zang, D
Chinzer, LA
Structured storage in ATLAS Distributed Data Management: use cases and experiences
title Structured storage in ATLAS Distributed Data Management: use cases and experiences
title_full Structured storage in ATLAS Distributed Data Management: use cases and experiences
title_fullStr Structured storage in ATLAS Distributed Data Management: use cases and experiences
title_full_unstemmed Structured storage in ATLAS Distributed Data Management: use cases and experiences
title_short Structured storage in ATLAS Distributed Data Management: use cases and experiences
title_sort structured storage in atlas distributed data management: use cases and experiences
topic Computing and Computers
url https://dx.doi.org/10.1088/1742-6596/396/5/052045
http://cds.cern.ch/record/1456345
work_keys_str_mv AT lassnigm structuredstorageinatlasdistributeddatamanagementusecasesandexperiences
AT garonnev structuredstorageinatlasdistributeddatamanagementusecasesandexperiences
AT molfetasa structuredstorageinatlasdistributeddatamanagementusecasesandexperiences
AT beermannt structuredstorageinatlasdistributeddatamanagementusecasesandexperiences
AT dimitrovg structuredstorageinatlasdistributeddatamanagementusecasesandexperiences
AT canalil structuredstorageinatlasdistributeddatamanagementusecasesandexperiences
AT zangd structuredstorageinatlasdistributeddatamanagementusecasesandexperiences
AT chinzerla structuredstorageinatlasdistributeddatamanagementusecasesandexperiences