Cargando…
Evaluation of software based redundancy algorithms for the EOS storage system at CERN
EOS is a new disk based storage system used in production at CERN since autumn 2011. It is implemented using the plug-in architecture of the XRootD software framework and allows remote file access via XRootD protocol or POSIX-like file access via FUSE mounting. EOS was designed to fulfill specific r...
Autores principales: | , , |
---|---|
Lenguaje: | eng |
Publicado: |
2012
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1088/1742-6596/396/4/042046 http://cds.cern.ch/record/1565909 |
_version_ | 1780930946517696512 |
---|---|
author | Peters, Andreas-Joachim Sindrilaru, Elvin Alin Zigann, Philipp |
author_facet | Peters, Andreas-Joachim Sindrilaru, Elvin Alin Zigann, Philipp |
author_sort | Peters, Andreas-Joachim |
collection | CERN |
description | EOS is a new disk based storage system used in production at CERN since autumn 2011. It is implemented using the plug-in architecture of the XRootD software framework and allows remote file access via XRootD protocol or POSIX-like file access via FUSE mounting. EOS was designed to fulfill specific requirements of disk storage scalability and IO scheduling performance for LHC analysis use cases. This is achieved by following a strategy of decoupling disk and tape storage as individual storage systems. A key point of the EOS design is to provide high availability and redundancy of files via a software implementation which uses disk-only storage systems without hardware RAID arrays. All this is aimed at reducing the overall cost of the system and also simplifying the operational procedures. This paper presents the advantages and disadvantages of redundancy by hardware (most classical storage installations) in comparison to redundancy by software. The latter is implemented in the EOS system and achieves its goal by spawning data and parity stripes via remote file access over nodes. The gain in redundancy and reliability comes with a trade-off in the following areas: Increased complexity of the network connectivity CPU intensive parity computations during file creation and recovery Performance loss through remote disk coupling An evaluation and performance figures of several redundancy algorithms are presented for dual parity RAID and Reed-Solomon codecs. Moreover, the characteristics and applicability of these algorithms are discussed in the context of reliable data storage systems. |
id | cern-1565909 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2012 |
record_format | invenio |
spelling | cern-15659092022-08-17T13:25:21Zdoi:10.1088/1742-6596/396/4/042046http://cds.cern.ch/record/1565909engPeters, Andreas-JoachimSindrilaru, Elvin AlinZigann, PhilippEvaluation of software based redundancy algorithms for the EOS storage system at CERNComputing and ComputersEOS is a new disk based storage system used in production at CERN since autumn 2011. It is implemented using the plug-in architecture of the XRootD software framework and allows remote file access via XRootD protocol or POSIX-like file access via FUSE mounting. EOS was designed to fulfill specific requirements of disk storage scalability and IO scheduling performance for LHC analysis use cases. This is achieved by following a strategy of decoupling disk and tape storage as individual storage systems. A key point of the EOS design is to provide high availability and redundancy of files via a software implementation which uses disk-only storage systems without hardware RAID arrays. All this is aimed at reducing the overall cost of the system and also simplifying the operational procedures. This paper presents the advantages and disadvantages of redundancy by hardware (most classical storage installations) in comparison to redundancy by software. The latter is implemented in the EOS system and achieves its goal by spawning data and parity stripes via remote file access over nodes. The gain in redundancy and reliability comes with a trade-off in the following areas: Increased complexity of the network connectivity CPU intensive parity computations during file creation and recovery Performance loss through remote disk coupling An evaluation and performance figures of several redundancy algorithms are presented for dual parity RAID and Reed-Solomon codecs. Moreover, the characteristics and applicability of these algorithms are discussed in the context of reliable data storage systems.oai:cds.cern.ch:15659092012 |
spellingShingle | Computing and Computers Peters, Andreas-Joachim Sindrilaru, Elvin Alin Zigann, Philipp Evaluation of software based redundancy algorithms for the EOS storage system at CERN |
title | Evaluation of software based redundancy algorithms for the EOS storage system at CERN |
title_full | Evaluation of software based redundancy algorithms for the EOS storage system at CERN |
title_fullStr | Evaluation of software based redundancy algorithms for the EOS storage system at CERN |
title_full_unstemmed | Evaluation of software based redundancy algorithms for the EOS storage system at CERN |
title_short | Evaluation of software based redundancy algorithms for the EOS storage system at CERN |
title_sort | evaluation of software based redundancy algorithms for the eos storage system at cern |
topic | Computing and Computers |
url | https://dx.doi.org/10.1088/1742-6596/396/4/042046 http://cds.cern.ch/record/1565909 |
work_keys_str_mv | AT petersandreasjoachim evaluationofsoftwarebasedredundancyalgorithmsfortheeosstoragesystematcern AT sindrilaruelvinalin evaluationofsoftwarebasedredundancyalgorithmsfortheeosstoragesystematcern AT zigannphilipp evaluationofsoftwarebasedredundancyalgorithmsfortheeosstoragesystematcern |