Cargando…

Extending Rucio with modern cloud storage support

Rucio is a software framework that provides scientific collaborations with the ability to organise, manage and access large volumes of data using customisable policies. The data can be spread across globally distributed locations and across heterogeneous data centres, uniting different storage and n...

Descripción completa

Detalles Bibliográficos
Autores principales: Lassnig, Mario, Barisits, Martin, Elmsheuser, Johannes, Patrascoiu, Mihai, Serfon, Cedric, Vendrell Moya, Alba, Wegner, Tobias
Lenguaje:eng
Publicado: 2023
Materias:
Acceso en línea:http://cds.cern.ch/record/2857740
_version_ 1780977581769621504
author Lassnig, Mario
Barisits, Martin
Elmsheuser, Johannes
Patrascoiu, Mihai
Serfon, Cedric
Vendrell Moya, Alba
Wegner, Tobias
author_facet Lassnig, Mario
Barisits, Martin
Elmsheuser, Johannes
Patrascoiu, Mihai
Serfon, Cedric
Vendrell Moya, Alba
Wegner, Tobias
author_sort Lassnig, Mario
collection CERN
description Rucio is a software framework that provides scientific collaborations with the ability to organise, manage and access large volumes of data using customisable policies. The data can be spread across globally distributed locations and across heterogeneous data centres, uniting different storage and network technologies as a single federated entity. Rucio offers advanced features such as distributed data recovery or adaptive replication, and is highly scalable, modular, and extensible. Rucio has been originally developed to meet the requirements of the high-energy physics experiment ATLAS, and is being continuously extended to support LHC experiments and other diverse scientific communities. In recent years several R&D projects in these communities have started to evaluate the integration of both private and commercially-provided cloud storage systems. As they are using Rucio, new functionality has been developed to make the integration as seamless as possible. In addition the underlying systems, FTS and GFAL/Davix, have been extended for these use cases. In this contribution we detail the technical aspects of this work. In particular the challenges when building a generic interface to self-hosted cloud storage such as MinIO or CEPH S3 Gateway, to established providers such as Google Cloud Storage and Amazon Simple Storage Service, as well as upcoming decentralised clouds such as SEAL. We will highlight aspects such as authentication and authorisation, direct and remote access, throughput and cost estimation, and give experiences on daily operations.
id cern-2857740
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2023
record_format invenio
spelling cern-28577402023-05-04T18:19:49Zhttp://cds.cern.ch/record/2857740engLassnig, MarioBarisits, MartinElmsheuser, JohannesPatrascoiu, MihaiSerfon, CedricVendrell Moya, AlbaWegner, TobiasExtending Rucio with modern cloud storage supportParticle Physics - ExperimentRucio is a software framework that provides scientific collaborations with the ability to organise, manage and access large volumes of data using customisable policies. The data can be spread across globally distributed locations and across heterogeneous data centres, uniting different storage and network technologies as a single federated entity. Rucio offers advanced features such as distributed data recovery or adaptive replication, and is highly scalable, modular, and extensible. Rucio has been originally developed to meet the requirements of the high-energy physics experiment ATLAS, and is being continuously extended to support LHC experiments and other diverse scientific communities. In recent years several R&D projects in these communities have started to evaluate the integration of both private and commercially-provided cloud storage systems. As they are using Rucio, new functionality has been developed to make the integration as seamless as possible. In addition the underlying systems, FTS and GFAL/Davix, have been extended for these use cases. In this contribution we detail the technical aspects of this work. In particular the challenges when building a generic interface to self-hosted cloud storage such as MinIO or CEPH S3 Gateway, to established providers such as Google Cloud Storage and Amazon Simple Storage Service, as well as upcoming decentralised clouds such as SEAL. We will highlight aspects such as authentication and authorisation, direct and remote access, throughput and cost estimation, and give experiences on daily operations.ATL-SOFT-SLIDE-2023-146oai:cds.cern.ch:28577402023-05-04
spellingShingle Particle Physics - Experiment
Lassnig, Mario
Barisits, Martin
Elmsheuser, Johannes
Patrascoiu, Mihai
Serfon, Cedric
Vendrell Moya, Alba
Wegner, Tobias
Extending Rucio with modern cloud storage support
title Extending Rucio with modern cloud storage support
title_full Extending Rucio with modern cloud storage support
title_fullStr Extending Rucio with modern cloud storage support
title_full_unstemmed Extending Rucio with modern cloud storage support
title_short Extending Rucio with modern cloud storage support
title_sort extending rucio with modern cloud storage support
topic Particle Physics - Experiment
url http://cds.cern.ch/record/2857740
work_keys_str_mv AT lassnigmario extendingruciowithmoderncloudstoragesupport
AT barisitsmartin extendingruciowithmoderncloudstoragesupport
AT elmsheuserjohannes extendingruciowithmoderncloudstoragesupport
AT patrascoiumihai extendingruciowithmoderncloudstoragesupport
AT serfoncedric extendingruciowithmoderncloudstoragesupport
AT vendrellmoyaalba extendingruciowithmoderncloudstoragesupport
AT wegnertobias extendingruciowithmoderncloudstoragesupport