Cargando…
Structured storage in ATLAS Distributed Data Management: use cases and experiences
The distributed data management system of the high-energy physics experiment ATLAS has a critical dependency on the Oracle Relational Database Management System. Recently however, the increased appearance of data warehouse-like workload in the experiment has put considerable and increasing strain on...
Autores principales: | , , , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2012
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1088/1742-6596/396/5/052045 http://cds.cern.ch/record/1456345 |
_version_ | 1780925085688791040 |
---|---|
author | Lassnig, M Garonne, V Molfetas, A Beermann, T Dimitrov, G Canali, L Zang, D Chinzer, LA |
author_facet | Lassnig, M Garonne, V Molfetas, A Beermann, T Dimitrov, G Canali, L Zang, D Chinzer, LA |
author_sort | Lassnig, M |
collection | CERN |
description | The distributed data management system of the high-energy physics experiment ATLAS has a critical dependency on the Oracle Relational Database Management System. Recently however, the increased appearance of data warehouse-like workload in the experiment has put considerable and increasing strain on the Oracle database. In particular, the analysis of archived data, and the aggregation of data for summary purposes has been especially demanding. For this reason, structured storage systems were evaluated to offload the Oracle database, and to handle processing of data in a non-transactional way. This includes distributed file systems like HDFS that support parallel execution of computational tasks on distributed data, as well as non-relational databases like HBase, Cassandra, or MongoDB. In this paper, the most important analysis and aggregation use cases of the data management system are presented, and how structured storage systems were established to process them. |
id | cern-1456345 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2012 |
record_format | invenio |
spelling | cern-14563452019-09-30T06:29:59Zdoi:10.1088/1742-6596/396/5/052045http://cds.cern.ch/record/1456345engLassnig, MGaronne, VMolfetas, ABeermann, TDimitrov, GCanali, LZang, DChinzer, LAStructured storage in ATLAS Distributed Data Management: use cases and experiencesComputing and ComputersThe distributed data management system of the high-energy physics experiment ATLAS has a critical dependency on the Oracle Relational Database Management System. Recently however, the increased appearance of data warehouse-like workload in the experiment has put considerable and increasing strain on the Oracle database. In particular, the analysis of archived data, and the aggregation of data for summary purposes has been especially demanding. For this reason, structured storage systems were evaluated to offload the Oracle database, and to handle processing of data in a non-transactional way. This includes distributed file systems like HDFS that support parallel execution of computational tasks on distributed data, as well as non-relational databases like HBase, Cassandra, or MongoDB. In this paper, the most important analysis and aggregation use cases of the data management system are presented, and how structured storage systems were established to process them.ATL-SOFT-PROC-2012-051oai:cds.cern.ch:14563452012-06-18 |
spellingShingle | Computing and Computers Lassnig, M Garonne, V Molfetas, A Beermann, T Dimitrov, G Canali, L Zang, D Chinzer, LA Structured storage in ATLAS Distributed Data Management: use cases and experiences |
title | Structured storage in ATLAS Distributed Data Management: use cases and experiences |
title_full | Structured storage in ATLAS Distributed Data Management: use cases and experiences |
title_fullStr | Structured storage in ATLAS Distributed Data Management: use cases and experiences |
title_full_unstemmed | Structured storage in ATLAS Distributed Data Management: use cases and experiences |
title_short | Structured storage in ATLAS Distributed Data Management: use cases and experiences |
title_sort | structured storage in atlas distributed data management: use cases and experiences |
topic | Computing and Computers |
url | https://dx.doi.org/10.1088/1742-6596/396/5/052045 http://cds.cern.ch/record/1456345 |
work_keys_str_mv | AT lassnigm structuredstorageinatlasdistributeddatamanagementusecasesandexperiences AT garonnev structuredstorageinatlasdistributeddatamanagementusecasesandexperiences AT molfetasa structuredstorageinatlasdistributeddatamanagementusecasesandexperiences AT beermannt structuredstorageinatlasdistributeddatamanagementusecasesandexperiences AT dimitrovg structuredstorageinatlasdistributeddatamanagementusecasesandexperiences AT canalil structuredstorageinatlasdistributeddatamanagementusecasesandexperiences AT zangd structuredstorageinatlasdistributeddatamanagementusecasesandexperiences AT chinzerla structuredstorageinatlasdistributeddatamanagementusecasesandexperiences |