Cargando…

The ATLAS TAGS database distribution and management: Operational challenges of a multi-terabyte distributed database

The TAG files store summary event quantities that allow a quick selection of interesting events. This data will be produced at a nominal rate of 200 Hz, and is uploaded into a relational database for access from websites and other tools. The estimated database volume is 6TB per year, making it the l...

Descripción completa

Detalles Bibliográficos
Autores principales: Viegas, F, Malon, D, Cranshaw, J, Dimitrov, G, Nowak, M, Nairz, A, Goossens, L, Gallas, E, Gamboa, C, Wong, A, Vinek, E
Lenguaje:eng
Publicado: 2010
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/219/7/072058
http://cds.cern.ch/record/1270560
Descripción
Sumario:The TAG files store summary event quantities that allow a quick selection of interesting events. This data will be produced at a nominal rate of 200 Hz, and is uploaded into a relational database for access from websites and other tools. The estimated database volume is 6TB per year, making it the largest application running on the ATLAS relational databases, at CERN and at other voluntary sites. The sheer volume and high rate of production makes this application a challenge to data and resource management, in many aspects. This paper will focus on the operational challenges of this system. These include: uploading the data from files to the CERN's and remote sites' databases; distributing the TAG metadata that is essential to guide the user through event selection; controlling resource usage of the database, from the user query load to the strategy of cleaning and archiving of old TAG data.