Cargando…

Evaluating CephFS Performance vs. Cost on High-Density Commodity Disk Servers

CephFS is a network filesystem built upon the Reliable Autonomic Distributed Object Store (RADOS). At CERN we have demonstrated its reliability and elasticity while operating several 100-to-1000TB clusters which provide NFS-like storage to infrastructure applications and services. At the same time,...

Descripción completa

Detalles Bibliográficos
Autores principales: Peters, Andreas J, van der Ster, Daniel C
Lenguaje:eng
Publicado: 2021
Materias:
Acceso en línea:https://dx.doi.org/10.1007/s41781-021-00071-1
http://cds.cern.ch/record/2790637
_version_ 1780972258201698304
author Peters, Andreas J
van der Ster, Daniel C
author_facet Peters, Andreas J
van der Ster, Daniel C
author_sort Peters, Andreas J
collection CERN
description CephFS is a network filesystem built upon the Reliable Autonomic Distributed Object Store (RADOS). At CERN we have demonstrated its reliability and elasticity while operating several 100-to-1000TB clusters which provide NFS-like storage to infrastructure applications and services. At the same time, our lab developed EOS to offer high performance 100PB-scale storage for the LHC at extremely low costs while also supporting the complete set of security and functional APIs required by the particle-physics user community. This work seeks to evaluate the performance of CephFS on this cost-optimized hardware when it is combined with EOS to support the missing functionalities. To this end, we have setup a proof-of-concept Ceph Octopus cluster on high-density JBOD servers (840 TB each) with 100Gig-E networking. The system uses EOS to provide an overlayed namespace and protocol gateways for HTTP(S) and XROOTD, and uses CephFS as an erasure-coded object storage backend. The solution also enables operators to aggregate several CephFS instances and adds features, such as third-party-copy, SciTokens, and high-level user and quota management. Using simple benchmarks we measure the cost/performance tradeoffs of different erasure-coding layouts, as well as the network overheads of these coding schemes. We demonstrate some relevant limitations of the CephFS metadata server and offer improved tunings which can be generally applicable. To conclude, we reflect on the advantages and drawbacks related to this architecture, such as RADOS-level free space requirements and double-network penalties, and offer ideas for improvements in the future.
id cern-2790637
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2021
record_format invenio
spelling cern-27906372021-11-16T19:52:05Zdoi:10.1007/s41781-021-00071-1http://cds.cern.ch/record/2790637engPeters, Andreas Jvan der Ster, Daniel CEvaluating CephFS Performance vs. Cost on High-Density Commodity Disk ServersComputing and ComputersCephFS is a network filesystem built upon the Reliable Autonomic Distributed Object Store (RADOS). At CERN we have demonstrated its reliability and elasticity while operating several 100-to-1000TB clusters which provide NFS-like storage to infrastructure applications and services. At the same time, our lab developed EOS to offer high performance 100PB-scale storage for the LHC at extremely low costs while also supporting the complete set of security and functional APIs required by the particle-physics user community. This work seeks to evaluate the performance of CephFS on this cost-optimized hardware when it is combined with EOS to support the missing functionalities. To this end, we have setup a proof-of-concept Ceph Octopus cluster on high-density JBOD servers (840 TB each) with 100Gig-E networking. The system uses EOS to provide an overlayed namespace and protocol gateways for HTTP(S) and XROOTD, and uses CephFS as an erasure-coded object storage backend. The solution also enables operators to aggregate several CephFS instances and adds features, such as third-party-copy, SciTokens, and high-level user and quota management. Using simple benchmarks we measure the cost/performance tradeoffs of different erasure-coding layouts, as well as the network overheads of these coding schemes. We demonstrate some relevant limitations of the CephFS metadata server and offer improved tunings which can be generally applicable. To conclude, we reflect on the advantages and drawbacks related to this architecture, such as RADOS-level free space requirements and double-network penalties, and offer ideas for improvements in the future.oai:cds.cern.ch:27906372021
spellingShingle Computing and Computers
Peters, Andreas J
van der Ster, Daniel C
Evaluating CephFS Performance vs. Cost on High-Density Commodity Disk Servers
title Evaluating CephFS Performance vs. Cost on High-Density Commodity Disk Servers
title_full Evaluating CephFS Performance vs. Cost on High-Density Commodity Disk Servers
title_fullStr Evaluating CephFS Performance vs. Cost on High-Density Commodity Disk Servers
title_full_unstemmed Evaluating CephFS Performance vs. Cost on High-Density Commodity Disk Servers
title_short Evaluating CephFS Performance vs. Cost on High-Density Commodity Disk Servers
title_sort evaluating cephfs performance vs. cost on high-density commodity disk servers
topic Computing and Computers
url https://dx.doi.org/10.1007/s41781-021-00071-1
http://cds.cern.ch/record/2790637
work_keys_str_mv AT petersandreasj evaluatingcephfsperformancevscostonhighdensitycommoditydiskservers
AT vandersterdanielc evaluatingcephfsperformancevscostonhighdensitycommoditydiskservers