Cargando…

The ATLAS Distributed Data Management System & Databases

The ATLAS Distributed Data Management (DDM) System is responsible for the global management of petabytes of high energy physics data. The current system, DQ2, has a critical dependency on Relational Database Management Systems (RDBMS), like Oracle. RDBMS are well-suited to enforcing data integrity i...

Descripción completa

Detalles Bibliográficos
Autores principales: Garonne, V, Lassnig, M, Barisits, M, Beermann, T, Vigne, R, Serfon, C
Lenguaje:eng
Publicado: 2013
Materias:
Acceso en línea:http://cds.cern.ch/record/1551545
_version_ 1780930246802931712
author Garonne, V
Lassnig, M
Barisits, M
Beermann, T
Vigne, R
Serfon, C
author_facet Garonne, V
Lassnig, M
Barisits, M
Beermann, T
Vigne, R
Serfon, C
author_sort Garonne, V
collection CERN
description The ATLAS Distributed Data Management (DDM) System is responsible for the global management of petabytes of high energy physics data. The current system, DQ2, has a critical dependency on Relational Database Management Systems (RDBMS), like Oracle. RDBMS are well-suited to enforcing data integrity in online transaction processing applications, however, concerns have been raised about the scalability of its data warehouse-like workload. In particular, analysis of archived data or aggregation of transactional data for summary purposes is problematic. Therefore, we have evaluated new approaches to handle vast amounts of data. We have investigated a class of database technologies commonly referred to as NoSQL databases. This includes distributed filesystems, like HDFS, that support parallel execution of computational tasks on distributed data, as well as schema-less approaches via key-value stores, like HBase. In this talk we will describe our use cases in ATLAS, share our experiences with various databases used in production and present the database technologies envisaged for the next-generation DDM system, Rucio. Rucio is an evolution of the ATLAS DDM system which addresses the scalability issues observed in DQ2.
id cern-1551545
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2013
record_format invenio
spelling cern-15515452019-09-30T06:29:59Zhttp://cds.cern.ch/record/1551545engGaronne, VLassnig, MBarisits, MBeermann, TVigne, RSerfon, CThe ATLAS Distributed Data Management System & DatabasesDetectors and Experimental TechniquesThe ATLAS Distributed Data Management (DDM) System is responsible for the global management of petabytes of high energy physics data. The current system, DQ2, has a critical dependency on Relational Database Management Systems (RDBMS), like Oracle. RDBMS are well-suited to enforcing data integrity in online transaction processing applications, however, concerns have been raised about the scalability of its data warehouse-like workload. In particular, analysis of archived data or aggregation of transactional data for summary purposes is problematic. Therefore, we have evaluated new approaches to handle vast amounts of data. We have investigated a class of database technologies commonly referred to as NoSQL databases. This includes distributed filesystems, like HDFS, that support parallel execution of computational tasks on distributed data, as well as schema-less approaches via key-value stores, like HBase. In this talk we will describe our use cases in ATLAS, share our experiences with various databases used in production and present the database technologies envisaged for the next-generation DDM system, Rucio. Rucio is an evolution of the ATLAS DDM system which addresses the scalability issues observed in DQ2.ATL-SOFT-SLIDE-2013-292oai:cds.cern.ch:15515452013-05-28
spellingShingle Detectors and Experimental Techniques
Garonne, V
Lassnig, M
Barisits, M
Beermann, T
Vigne, R
Serfon, C
The ATLAS Distributed Data Management System & Databases
title The ATLAS Distributed Data Management System & Databases
title_full The ATLAS Distributed Data Management System & Databases
title_fullStr The ATLAS Distributed Data Management System & Databases
title_full_unstemmed The ATLAS Distributed Data Management System & Databases
title_short The ATLAS Distributed Data Management System & Databases
title_sort atlas distributed data management system & databases
topic Detectors and Experimental Techniques
url http://cds.cern.ch/record/1551545
work_keys_str_mv AT garonnev theatlasdistributeddatamanagementsystemdatabases
AT lassnigm theatlasdistributeddatamanagementsystemdatabases
AT barisitsm theatlasdistributeddatamanagementsystemdatabases
AT beermannt theatlasdistributeddatamanagementsystemdatabases
AT vigner theatlasdistributeddatamanagementsystemdatabases
AT serfonc theatlasdistributeddatamanagementsystemdatabases
AT garonnev atlasdistributeddatamanagementsystemdatabases
AT lassnigm atlasdistributeddatamanagementsystemdatabases
AT barisitsm atlasdistributeddatamanagementsystemdatabases
AT beermannt atlasdistributeddatamanagementsystemdatabases
AT vigner atlasdistributeddatamanagementsystemdatabases
AT serfonc atlasdistributeddatamanagementsystemdatabases