Cargando…
The ATLAS Distributed Data Management System & Databases
The ATLAS Distributed Data Management (DDM) System is responsible for the global management of petabytes of high energy physics data. The current system, DQ2, has a critical dependency on Relational Database Management Systems (RDBMS), like Oracle. RDBMS are well-suited to enforcing data integrity i...
Autores principales: | , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2013
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/1551545 |
_version_ | 1780930246802931712 |
---|---|
author | Garonne, V Lassnig, M Barisits, M Beermann, T Vigne, R Serfon, C |
author_facet | Garonne, V Lassnig, M Barisits, M Beermann, T Vigne, R Serfon, C |
author_sort | Garonne, V |
collection | CERN |
description | The ATLAS Distributed Data Management (DDM) System is responsible for the global management of petabytes of high energy physics data. The current system, DQ2, has a critical dependency on Relational Database Management Systems (RDBMS), like Oracle. RDBMS are well-suited to enforcing data integrity in online transaction processing applications, however, concerns have been raised about the scalability of its data warehouse-like workload. In particular, analysis of archived data or aggregation of transactional data for summary purposes is problematic. Therefore, we have evaluated new approaches to handle vast amounts of data. We have investigated a class of database technologies commonly referred to as NoSQL databases. This includes distributed filesystems, like HDFS, that support parallel execution of computational tasks on distributed data, as well as schema-less approaches via key-value stores, like HBase. In this talk we will describe our use cases in ATLAS, share our experiences with various databases used in production and present the database technologies envisaged for the next-generation DDM system, Rucio. Rucio is an evolution of the ATLAS DDM system which addresses the scalability issues observed in DQ2. |
id | cern-1551545 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2013 |
record_format | invenio |
spelling | cern-15515452019-09-30T06:29:59Zhttp://cds.cern.ch/record/1551545engGaronne, VLassnig, MBarisits, MBeermann, TVigne, RSerfon, CThe ATLAS Distributed Data Management System & DatabasesDetectors and Experimental TechniquesThe ATLAS Distributed Data Management (DDM) System is responsible for the global management of petabytes of high energy physics data. The current system, DQ2, has a critical dependency on Relational Database Management Systems (RDBMS), like Oracle. RDBMS are well-suited to enforcing data integrity in online transaction processing applications, however, concerns have been raised about the scalability of its data warehouse-like workload. In particular, analysis of archived data or aggregation of transactional data for summary purposes is problematic. Therefore, we have evaluated new approaches to handle vast amounts of data. We have investigated a class of database technologies commonly referred to as NoSQL databases. This includes distributed filesystems, like HDFS, that support parallel execution of computational tasks on distributed data, as well as schema-less approaches via key-value stores, like HBase. In this talk we will describe our use cases in ATLAS, share our experiences with various databases used in production and present the database technologies envisaged for the next-generation DDM system, Rucio. Rucio is an evolution of the ATLAS DDM system which addresses the scalability issues observed in DQ2.ATL-SOFT-SLIDE-2013-292oai:cds.cern.ch:15515452013-05-28 |
spellingShingle | Detectors and Experimental Techniques Garonne, V Lassnig, M Barisits, M Beermann, T Vigne, R Serfon, C The ATLAS Distributed Data Management System & Databases |
title | The ATLAS Distributed Data Management System & Databases |
title_full | The ATLAS Distributed Data Management System & Databases |
title_fullStr | The ATLAS Distributed Data Management System & Databases |
title_full_unstemmed | The ATLAS Distributed Data Management System & Databases |
title_short | The ATLAS Distributed Data Management System & Databases |
title_sort | atlas distributed data management system & databases |
topic | Detectors and Experimental Techniques |
url | http://cds.cern.ch/record/1551545 |
work_keys_str_mv | AT garonnev theatlasdistributeddatamanagementsystemdatabases AT lassnigm theatlasdistributeddatamanagementsystemdatabases AT barisitsm theatlasdistributeddatamanagementsystemdatabases AT beermannt theatlasdistributeddatamanagementsystemdatabases AT vigner theatlasdistributeddatamanagementsystemdatabases AT serfonc theatlasdistributeddatamanagementsystemdatabases AT garonnev atlasdistributeddatamanagementsystemdatabases AT lassnigm atlasdistributeddatamanagementsystemdatabases AT barisitsm atlasdistributeddatamanagementsystemdatabases AT beermannt atlasdistributeddatamanagementsystemdatabases AT vigner atlasdistributeddatamanagementsystemdatabases AT serfonc atlasdistributeddatamanagementsystemdatabases |