Cargando…

CERN Tape Archive: a distributed, reliable and scalable scheduling system

The CERN Tape Archive (CTA) provides a tape backend to disk systems and, in conjunction with EOS, is managing the data of the LHC experiments at CERN.Magnetic tape storage offers the lowest cost per unit volume today, followed by hard disks and flash. In addition, current tape drives deliver a solid...

Descripción completa

Detalles Bibliográficos
Autores principales: Cano, Eric, Bahyl, Vladimír, Caffy, Cédric, Cancio, Germán, Davis, Michael, Keeble, Oliver, Kotlyar, Viktor, Leduc, Julien, Murray, Steven
Lenguaje:eng
Publicado: 2021
Materias:
Acceso en línea:https://dx.doi.org/10.1051/epjconf/202125102037
http://cds.cern.ch/record/2799972
_version_ 1780972604914401280
author Cano, Eric
Bahyl, Vladimír
Caffy, Cédric
Cancio, Germán
Davis, Michael
Keeble, Oliver
Kotlyar, Viktor
Leduc, Julien
Murray, Steven
author_facet Cano, Eric
Bahyl, Vladimír
Caffy, Cédric
Cancio, Germán
Davis, Michael
Keeble, Oliver
Kotlyar, Viktor
Leduc, Julien
Murray, Steven
author_sort Cano, Eric
collection CERN
description The CERN Tape Archive (CTA) provides a tape backend to disk systems and, in conjunction with EOS, is managing the data of the LHC experiments at CERN.Magnetic tape storage offers the lowest cost per unit volume today, followed by hard disks and flash. In addition, current tape drives deliver a solid bandwidth (typically 360MB/s per device), but at the cost of high latencies, both for mounting a tape in the drive and for positioning when accessing non-adjacent files. As a consequence, the transfer scheduler should queue transfer requests before the volume warranting a tape mount is reached. In spite of these transfer latencies, user-interactive operations should have a low latency.The scheduling system for CTA was built from the experience gained with CASTOR. Its implementation ensures reliability and predictable performance, while simplifying development and deployment. As CTA is expected to be used for a long time, lock-in to vendors or technologies was minimized. Finally, quality assurance systems were put in place to validate reliability and performance while allowing fast and safe development turnaround.
id cern-2799972
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2021
record_format invenio
spelling cern-27999722022-01-20T21:41:22Zdoi:10.1051/epjconf/202125102037http://cds.cern.ch/record/2799972engCano, EricBahyl, VladimírCaffy, CédricCancio, GermánDavis, MichaelKeeble, OliverKotlyar, ViktorLeduc, JulienMurray, StevenCERN Tape Archive: a distributed, reliable and scalable scheduling systemComputing and ComputersThe CERN Tape Archive (CTA) provides a tape backend to disk systems and, in conjunction with EOS, is managing the data of the LHC experiments at CERN.Magnetic tape storage offers the lowest cost per unit volume today, followed by hard disks and flash. In addition, current tape drives deliver a solid bandwidth (typically 360MB/s per device), but at the cost of high latencies, both for mounting a tape in the drive and for positioning when accessing non-adjacent files. As a consequence, the transfer scheduler should queue transfer requests before the volume warranting a tape mount is reached. In spite of these transfer latencies, user-interactive operations should have a low latency.The scheduling system for CTA was built from the experience gained with CASTOR. Its implementation ensures reliability and predictable performance, while simplifying development and deployment. As CTA is expected to be used for a long time, lock-in to vendors or technologies was minimized. Finally, quality assurance systems were put in place to validate reliability and performance while allowing fast and safe development turnaround.oai:cds.cern.ch:27999722021
spellingShingle Computing and Computers
Cano, Eric
Bahyl, Vladimír
Caffy, Cédric
Cancio, Germán
Davis, Michael
Keeble, Oliver
Kotlyar, Viktor
Leduc, Julien
Murray, Steven
CERN Tape Archive: a distributed, reliable and scalable scheduling system
title CERN Tape Archive: a distributed, reliable and scalable scheduling system
title_full CERN Tape Archive: a distributed, reliable and scalable scheduling system
title_fullStr CERN Tape Archive: a distributed, reliable and scalable scheduling system
title_full_unstemmed CERN Tape Archive: a distributed, reliable and scalable scheduling system
title_short CERN Tape Archive: a distributed, reliable and scalable scheduling system
title_sort cern tape archive: a distributed, reliable and scalable scheduling system
topic Computing and Computers
url https://dx.doi.org/10.1051/epjconf/202125102037
http://cds.cern.ch/record/2799972
work_keys_str_mv AT canoeric cerntapearchiveadistributedreliableandscalableschedulingsystem
AT bahylvladimir cerntapearchiveadistributedreliableandscalableschedulingsystem
AT caffycedric cerntapearchiveadistributedreliableandscalableschedulingsystem
AT canciogerman cerntapearchiveadistributedreliableandscalableschedulingsystem
AT davismichael cerntapearchiveadistributedreliableandscalableschedulingsystem
AT keebleoliver cerntapearchiveadistributedreliableandscalableschedulingsystem
AT kotlyarviktor cerntapearchiveadistributedreliableandscalableschedulingsystem
AT leducjulien cerntapearchiveadistributedreliableandscalableschedulingsystem
AT murraysteven cerntapearchiveadistributedreliableandscalableschedulingsystem