Cargando…

Towards a responsive CernVM-FS architecture

The CernVM File System (CernVM-FS) provides a scalable and reliable software distribution service implemented as a POSIX read-only filesystem in user space (FUSE). It was originally developed at CERN to assist High Energy Physics (HEP) collaborations in deploying software on the worldwide distribute...

Descripción completa

Detalles Bibliográficos
Autores principales: Popescu, Radu, Blomer, Jakob, Ganis, Gerardo
Lenguaje:eng
Publicado: 2019
Materias:
Acceso en línea:https://dx.doi.org/10.1051/epjconf/201921403036
http://cds.cern.ch/record/2701496
_version_ 1780964598410641408
author Popescu, Radu
Blomer, Jakob
Ganis, Gerardo
author_facet Popescu, Radu
Blomer, Jakob
Ganis, Gerardo
author_sort Popescu, Radu
collection CERN
description The CernVM File System (CernVM-FS) provides a scalable and reliable software distribution service implemented as a POSIX read-only filesystem in user space (FUSE). It was originally developed at CERN to assist High Energy Physics (HEP) collaborations in deploying software on the worldwide distributed computing infrastructure for data processing applications. Files are stored remotely as content-addressed blocks on standard web servers and are retrieved and cached on-demand through outgoing HTTP connections only. Repository metadata is recorded in SQLite catalogs, which represent implicit Merkle tree encodings of the repository state. For writing, CernVM-FS follows a publish-subscribe pattern with a single source of new content that is propagated to a large number of readers. This paper focuses on the work to move the CernVM-FS architecture in the direction of a responsive data distribution system. A new distributed publication backend allows scaling out large publication tasks across multiple machines, reducing the time to publish. For the faster propagation of new published content, the addition of a notification system allows clients to subscribe to messages about changes in the repository and to request new root catalogs as soon as they become available. These developments make CernVM-FS more responsive and are particularly relevant for use cases where a short propagation delay from repository down to individual clients is important, such as using CernVM-FS as an AFS replacement for distributing software stacks. Additionally, they permit the implementation of more complex workflows, with producer-consumer pipelines, as for example in the ALICE analysis trains system.
id oai-inspirehep.net-1760934
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2019
record_format invenio
spelling oai-inspirehep.net-17609342022-08-10T12:20:50Zdoi:10.1051/epjconf/201921403036http://cds.cern.ch/record/2701496engPopescu, RaduBlomer, JakobGanis, GerardoTowards a responsive CernVM-FS architectureComputing and ComputersThe CernVM File System (CernVM-FS) provides a scalable and reliable software distribution service implemented as a POSIX read-only filesystem in user space (FUSE). It was originally developed at CERN to assist High Energy Physics (HEP) collaborations in deploying software on the worldwide distributed computing infrastructure for data processing applications. Files are stored remotely as content-addressed blocks on standard web servers and are retrieved and cached on-demand through outgoing HTTP connections only. Repository metadata is recorded in SQLite catalogs, which represent implicit Merkle tree encodings of the repository state. For writing, CernVM-FS follows a publish-subscribe pattern with a single source of new content that is propagated to a large number of readers. This paper focuses on the work to move the CernVM-FS architecture in the direction of a responsive data distribution system. A new distributed publication backend allows scaling out large publication tasks across multiple machines, reducing the time to publish. For the faster propagation of new published content, the addition of a notification system allows clients to subscribe to messages about changes in the repository and to request new root catalogs as soon as they become available. These developments make CernVM-FS more responsive and are particularly relevant for use cases where a short propagation delay from repository down to individual clients is important, such as using CernVM-FS as an AFS replacement for distributing software stacks. Additionally, they permit the implementation of more complex workflows, with producer-consumer pipelines, as for example in the ALICE analysis trains system.oai:inspirehep.net:17609342019
spellingShingle Computing and Computers
Popescu, Radu
Blomer, Jakob
Ganis, Gerardo
Towards a responsive CernVM-FS architecture
title Towards a responsive CernVM-FS architecture
title_full Towards a responsive CernVM-FS architecture
title_fullStr Towards a responsive CernVM-FS architecture
title_full_unstemmed Towards a responsive CernVM-FS architecture
title_short Towards a responsive CernVM-FS architecture
title_sort towards a responsive cernvm-fs architecture
topic Computing and Computers
url https://dx.doi.org/10.1051/epjconf/201921403036
http://cds.cern.ch/record/2701496
work_keys_str_mv AT popescuradu towardsaresponsivecernvmfsarchitecture
AT blomerjakob towardsaresponsivecernvmfsarchitecture
AT ganisgerardo towardsaresponsivecernvmfsarchitecture