Cargando…

Scaling the EOS namespace - new developments, and performance optimizations

EOS is the distributed storage solution being developed and deployed at CERN with the primary goal of fulfilling the data needs of the LHC and its various experiments. Being in production since 2011, EOS currently manages around 256 petabytes of raw disk space and 3.4 billion files across several in...

Descripción completa

Detalles Bibliográficos
Autores principales: Bitzes, Georgios, Sindrilaru, Elvin Alin, Peters, Andreas Joachim
Lenguaje:eng
Publicado: 2019
Materias:
Acceso en línea:https://dx.doi.org/10.1051/epjconf/201921404019
http://cds.cern.ch/record/2701401
_version_ 1780964600775180288
author Bitzes, Georgios
Sindrilaru, Elvin Alin
Peters, Andreas Joachim
author_facet Bitzes, Georgios
Sindrilaru, Elvin Alin
Peters, Andreas Joachim
author_sort Bitzes, Georgios
collection CERN
description EOS is the distributed storage solution being developed and deployed at CERN with the primary goal of fulfilling the data needs of the LHC and its various experiments. Being in production since 2011, EOS currently manages around 256 petabytes of raw disk space and 3.4 billion files across several instances. Nowadays, EOS is increasingly being used as a distributed filesystem and file sharing platform, which poses scalability challenges on its legacy namespace subsystem, tasked with keeping track of all file and directory metadata on a particular instance. In this paper we discuss said challenges, and present our solution which has recently entered production. We made several architectural improvements to the overall system design, the most important of which was introducing QuarkDB, a highly-available datastore capable of serving as the metadata backend for EOS, tailored to the needs of the namespace. We also describe our efforts in providing comparable latency and performance to the legacy in-memory implementation, both when reading through the use of extensive caching and prefetching, and when writing through the use of latency-hiding techniques involving a persistent, back-pressured local queue for batching writes towards the QuarkDB backend.
id oai-inspirehep.net-1760973
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2019
record_format invenio
spelling oai-inspirehep.net-17609732022-08-10T12:20:31Zdoi:10.1051/epjconf/201921404019http://cds.cern.ch/record/2701401engBitzes, GeorgiosSindrilaru, Elvin AlinPeters, Andreas JoachimScaling the EOS namespace - new developments, and performance optimizationsComputing and ComputersEOS is the distributed storage solution being developed and deployed at CERN with the primary goal of fulfilling the data needs of the LHC and its various experiments. Being in production since 2011, EOS currently manages around 256 petabytes of raw disk space and 3.4 billion files across several instances. Nowadays, EOS is increasingly being used as a distributed filesystem and file sharing platform, which poses scalability challenges on its legacy namespace subsystem, tasked with keeping track of all file and directory metadata on a particular instance. In this paper we discuss said challenges, and present our solution which has recently entered production. We made several architectural improvements to the overall system design, the most important of which was introducing QuarkDB, a highly-available datastore capable of serving as the metadata backend for EOS, tailored to the needs of the namespace. We also describe our efforts in providing comparable latency and performance to the legacy in-memory implementation, both when reading through the use of extensive caching and prefetching, and when writing through the use of latency-hiding techniques involving a persistent, back-pressured local queue for batching writes towards the QuarkDB backend.oai:inspirehep.net:17609732019
spellingShingle Computing and Computers
Bitzes, Georgios
Sindrilaru, Elvin Alin
Peters, Andreas Joachim
Scaling the EOS namespace - new developments, and performance optimizations
title Scaling the EOS namespace - new developments, and performance optimizations
title_full Scaling the EOS namespace - new developments, and performance optimizations
title_fullStr Scaling the EOS namespace - new developments, and performance optimizations
title_full_unstemmed Scaling the EOS namespace - new developments, and performance optimizations
title_short Scaling the EOS namespace - new developments, and performance optimizations
title_sort scaling the eos namespace - new developments, and performance optimizations
topic Computing and Computers
url https://dx.doi.org/10.1051/epjconf/201921404019
http://cds.cern.ch/record/2701401
work_keys_str_mv AT bitzesgeorgios scalingtheeosnamespacenewdevelopmentsandperformanceoptimizations
AT sindrilaruelvinalin scalingtheeosnamespacenewdevelopmentsandperformanceoptimizations
AT petersandreasjoachim scalingtheeosnamespacenewdevelopmentsandperformanceoptimizations