Cargando…
An inconvenient truth: file-level metadata and in-file metadata caching in the (file-agnostic) ATLAS event store
In the ATLAS event store, files are sometimes 'an inconvenient truth.' From the point of view of the ATLAS distributed data management system, files are too small—datasets are the units of interest. From the point of view of the ATLAS event store architecture, files are simply a physical c...
Autores principales: | , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2008
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1088/1742-6596/119/4/042022 http://cds.cern.ch/record/1176561 |
_version_ | 1780916253216473088 |
---|---|
author | Malon, D Van Gemmeren, P Hawkings, R Schaffer, A |
author_facet | Malon, D Van Gemmeren, P Hawkings, R Schaffer, A |
author_sort | Malon, D |
collection | CERN |
description | In the ATLAS event store, files are sometimes 'an inconvenient truth.' From the point of view of the ATLAS distributed data management system, files are too small—datasets are the units of interest. From the point of view of the ATLAS event store architecture, files are simply a physical clustering optimization: the units of interest are event collections—sets of events that satisfy common conditions or selection predicates—and such collections may or may not have been accumulated into files that contain those events and no others. It is nonetheless important to maintain file-level metadata, and to cache metadata in event data files. When such metadata may or may not be present in files, or when values may have been updated after files are written and replicated, a clear and transparent model for metadata retrieval from the file itself or from remote databases is required. In this paper we describe how ATLAS reconciles its file and non-file paradigms, the machinery for associating metadata with files and event collections, and the infrastructure for metadata propagation from input to output for provenance record management and related purposes. |
id | cern-1176561 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2008 |
record_format | invenio |
spelling | cern-11765612022-08-17T13:35:42Zdoi:10.1088/1742-6596/119/4/042022http://cds.cern.ch/record/1176561engMalon, DVan Gemmeren, PHawkings, RSchaffer, AAn inconvenient truth: file-level metadata and in-file metadata caching in the (file-agnostic) ATLAS event storeComputing and ComputersIn the ATLAS event store, files are sometimes 'an inconvenient truth.' From the point of view of the ATLAS distributed data management system, files are too small—datasets are the units of interest. From the point of view of the ATLAS event store architecture, files are simply a physical clustering optimization: the units of interest are event collections—sets of events that satisfy common conditions or selection predicates—and such collections may or may not have been accumulated into files that contain those events and no others. It is nonetheless important to maintain file-level metadata, and to cache metadata in event data files. When such metadata may or may not be present in files, or when values may have been updated after files are written and replicated, a clear and transparent model for metadata retrieval from the file itself or from remote databases is required. In this paper we describe how ATLAS reconciles its file and non-file paradigms, the machinery for associating metadata with files and event collections, and the infrastructure for metadata propagation from input to output for provenance record management and related purposes.oai:cds.cern.ch:11765612008 |
spellingShingle | Computing and Computers Malon, D Van Gemmeren, P Hawkings, R Schaffer, A An inconvenient truth: file-level metadata and in-file metadata caching in the (file-agnostic) ATLAS event store |
title | An inconvenient truth: file-level metadata and in-file metadata caching in the (file-agnostic) ATLAS event store |
title_full | An inconvenient truth: file-level metadata and in-file metadata caching in the (file-agnostic) ATLAS event store |
title_fullStr | An inconvenient truth: file-level metadata and in-file metadata caching in the (file-agnostic) ATLAS event store |
title_full_unstemmed | An inconvenient truth: file-level metadata and in-file metadata caching in the (file-agnostic) ATLAS event store |
title_short | An inconvenient truth: file-level metadata and in-file metadata caching in the (file-agnostic) ATLAS event store |
title_sort | inconvenient truth: file-level metadata and in-file metadata caching in the (file-agnostic) atlas event store |
topic | Computing and Computers |
url | https://dx.doi.org/10.1088/1742-6596/119/4/042022 http://cds.cern.ch/record/1176561 |
work_keys_str_mv | AT malond aninconvenienttruthfilelevelmetadataandinfilemetadatacachinginthefileagnosticatlaseventstore AT vangemmerenp aninconvenienttruthfilelevelmetadataandinfilemetadatacachinginthefileagnosticatlaseventstore AT hawkingsr aninconvenienttruthfilelevelmetadataandinfilemetadatacachinginthefileagnosticatlaseventstore AT schaffera aninconvenienttruthfilelevelmetadataandinfilemetadatacachinginthefileagnosticatlaseventstore AT malond inconvenienttruthfilelevelmetadataandinfilemetadatacachinginthefileagnosticatlaseventstore AT vangemmerenp inconvenienttruthfilelevelmetadataandinfilemetadatacachinginthefileagnosticatlaseventstore AT hawkingsr inconvenienttruthfilelevelmetadataandinfilemetadatacachinginthefileagnosticatlaseventstore AT schaffera inconvenienttruthfilelevelmetadataandinfilemetadatacachinginthefileagnosticatlaseventstore |