Cargando…

An inconvenient truth: file-level metadata and in-file metadata caching in the (file-agnostic) ATLAS event store

In the ATLAS event store, files are sometimes 'an inconvenient truth.' From the point of view of the ATLAS distributed data management system, files are too small—datasets are the units of interest. From the point of view of the ATLAS event store architecture, files are simply a physical c...

Descripción completa

Detalles Bibliográficos
Autores principales: Malon, D, Van Gemmeren, P, Hawkings, R, Schaffer, A
Lenguaje:eng
Publicado: 2008
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/119/4/042022
http://cds.cern.ch/record/1176561
_version_ 1780916253216473088
author Malon, D
Van Gemmeren, P
Hawkings, R
Schaffer, A
author_facet Malon, D
Van Gemmeren, P
Hawkings, R
Schaffer, A
author_sort Malon, D
collection CERN
description In the ATLAS event store, files are sometimes 'an inconvenient truth.' From the point of view of the ATLAS distributed data management system, files are too small—datasets are the units of interest. From the point of view of the ATLAS event store architecture, files are simply a physical clustering optimization: the units of interest are event collections—sets of events that satisfy common conditions or selection predicates—and such collections may or may not have been accumulated into files that contain those events and no others. It is nonetheless important to maintain file-level metadata, and to cache metadata in event data files. When such metadata may or may not be present in files, or when values may have been updated after files are written and replicated, a clear and transparent model for metadata retrieval from the file itself or from remote databases is required. In this paper we describe how ATLAS reconciles its file and non-file paradigms, the machinery for associating metadata with files and event collections, and the infrastructure for metadata propagation from input to output for provenance record management and related purposes.
id cern-1176561
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2008
record_format invenio
spelling cern-11765612022-08-17T13:35:42Zdoi:10.1088/1742-6596/119/4/042022http://cds.cern.ch/record/1176561engMalon, DVan Gemmeren, PHawkings, RSchaffer, AAn inconvenient truth: file-level metadata and in-file metadata caching in the (file-agnostic) ATLAS event storeComputing and ComputersIn the ATLAS event store, files are sometimes 'an inconvenient truth.' From the point of view of the ATLAS distributed data management system, files are too small—datasets are the units of interest. From the point of view of the ATLAS event store architecture, files are simply a physical clustering optimization: the units of interest are event collections—sets of events that satisfy common conditions or selection predicates—and such collections may or may not have been accumulated into files that contain those events and no others. It is nonetheless important to maintain file-level metadata, and to cache metadata in event data files. When such metadata may or may not be present in files, or when values may have been updated after files are written and replicated, a clear and transparent model for metadata retrieval from the file itself or from remote databases is required. In this paper we describe how ATLAS reconciles its file and non-file paradigms, the machinery for associating metadata with files and event collections, and the infrastructure for metadata propagation from input to output for provenance record management and related purposes.oai:cds.cern.ch:11765612008
spellingShingle Computing and Computers
Malon, D
Van Gemmeren, P
Hawkings, R
Schaffer, A
An inconvenient truth: file-level metadata and in-file metadata caching in the (file-agnostic) ATLAS event store
title An inconvenient truth: file-level metadata and in-file metadata caching in the (file-agnostic) ATLAS event store
title_full An inconvenient truth: file-level metadata and in-file metadata caching in the (file-agnostic) ATLAS event store
title_fullStr An inconvenient truth: file-level metadata and in-file metadata caching in the (file-agnostic) ATLAS event store
title_full_unstemmed An inconvenient truth: file-level metadata and in-file metadata caching in the (file-agnostic) ATLAS event store
title_short An inconvenient truth: file-level metadata and in-file metadata caching in the (file-agnostic) ATLAS event store
title_sort inconvenient truth: file-level metadata and in-file metadata caching in the (file-agnostic) atlas event store
topic Computing and Computers
url https://dx.doi.org/10.1088/1742-6596/119/4/042022
http://cds.cern.ch/record/1176561
work_keys_str_mv AT malond aninconvenienttruthfilelevelmetadataandinfilemetadatacachinginthefileagnosticatlaseventstore
AT vangemmerenp aninconvenienttruthfilelevelmetadataandinfilemetadatacachinginthefileagnosticatlaseventstore
AT hawkingsr aninconvenienttruthfilelevelmetadataandinfilemetadatacachinginthefileagnosticatlaseventstore
AT schaffera aninconvenienttruthfilelevelmetadataandinfilemetadatacachinginthefileagnosticatlaseventstore
AT malond inconvenienttruthfilelevelmetadataandinfilemetadatacachinginthefileagnosticatlaseventstore
AT vangemmerenp inconvenienttruthfilelevelmetadataandinfilemetadatacachinginthefileagnosticatlaseventstore
AT hawkingsr inconvenienttruthfilelevelmetadataandinfilemetadatacachinginthefileagnosticatlaseventstore
AT schaffera inconvenienttruthfilelevelmetadataandinfilemetadatacachinginthefileagnosticatlaseventstore