Cargando…

Design of a Resilient, High-Throughput, Persistent Storage System for the ATLAS Phase-II DAQ System

The ATLAS experiment will undergo a major upgrade to take advantage of the new conditions provided by the upgraded High-Luminosity LHC. The Trigger and Data Acquisition system (TDAQ) will record data at unprecedented rates: the detectors will be read out at 1 MHz generating around 5 TB/s of data. Th...

Descripción completa

Detalles Bibliográficos
Autores principales: Abud, Adam Abed, Bonaventura, Matias, Farina, Edoardo, Le Goff, Fabrice
Lenguaje:eng
Publicado: 2021
Materias:
Acceso en línea:https://dx.doi.org/10.1051/epjconf/202125104014
http://cds.cern.ch/record/2772665
_version_ 1780971464144453632
author Abud, Adam Abed
Bonaventura, Matias
Farina, Edoardo
Le Goff, Fabrice
author_facet Abud, Adam Abed
Bonaventura, Matias
Farina, Edoardo
Le Goff, Fabrice
author_sort Abud, Adam Abed
collection CERN
description The ATLAS experiment will undergo a major upgrade to take advantage of the new conditions provided by the upgraded High-Luminosity LHC. The Trigger and Data Acquisition system (TDAQ) will record data at unprecedented rates: the detectors will be read out at 1 MHz generating around 5 TB/s of data. The Dataflow system (DF), component of TDAQ, introduces a novel design: readout data are buffered on persistent storage while the event filtering system analyses them to select 10000 events per second for a total recorded throughput of around 60 GB/s. This approach allows for decoupling the detector activity from the event selection process. New challenges then arise for DF: design and implement a distributed, reliable, persistent storage system supporting several TB/s of aggregated throughput while providing tens of PB of capacity.In this paper we first describe some of the challenges that DF is facing: data safety with persistent storage limitations, indexing of data at high-granularity in a highly-distributed system, and high-performance management of storage capacity. Then the ongoing R&D to address each of the them is presented and the performance achieved with a working prototype is shown.
id cern-2772665
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2021
record_format invenio
spelling cern-27726652022-08-22T20:09:01Zdoi:10.1051/epjconf/202125104014http://cds.cern.ch/record/2772665engAbud, Adam AbedBonaventura, MatiasFarina, EdoardoLe Goff, FabriceDesign of a Resilient, High-Throughput, Persistent Storage System for the ATLAS Phase-II DAQ SystemParticle Physics - ExperimentComputing and ComputersThe ATLAS experiment will undergo a major upgrade to take advantage of the new conditions provided by the upgraded High-Luminosity LHC. The Trigger and Data Acquisition system (TDAQ) will record data at unprecedented rates: the detectors will be read out at 1 MHz generating around 5 TB/s of data. The Dataflow system (DF), component of TDAQ, introduces a novel design: readout data are buffered on persistent storage while the event filtering system analyses them to select 10000 events per second for a total recorded throughput of around 60 GB/s. This approach allows for decoupling the detector activity from the event selection process. New challenges then arise for DF: design and implement a distributed, reliable, persistent storage system supporting several TB/s of aggregated throughput while providing tens of PB of capacity.In this paper we first describe some of the challenges that DF is facing: data safety with persistent storage limitations, indexing of data at high-granularity in a highly-distributed system, and high-performance management of storage capacity. Then the ongoing R&D to address each of the them is presented and the performance achieved with a working prototype is shown.The ATLAS experiment will undergo a major upgrade to take advantage of the new conditions provided by the upgraded High-Luminosity LHC. The Trigger and Data Acquisition system (TDAQ) will record data at unprecedented rates: the detectors will be read out at 1 MHz generating around 5 TB/s of data. The Dataflow system (DF), component of TDAQ, introduces a novel design: readout data are buffered on persistent storage while the event filtering system analyses them to select 10000 events per second for a total recorded throughput of around 60 GB/s. This approach allows for decoupling the detector activity from the event selection process. New challenges then arise for DF: design and implement a distributed, reliable, persistent storage system supporting several TB/s of aggregated throughput while providing tens of PB of capacity. In this paper we first describe some of the challenges that DF is facing: data safety with persistent storage limitations, indexing of data at high-granularity in a highly-distributed system, and high-performance management of storage capacity. Then the ongoing R&D; to address each of the them is presented and the performance achieved with a working prototype is shown.ATL-DAQ-PROC-2021-002oai:cds.cern.ch:27726652021-06-14
spellingShingle Particle Physics - Experiment
Computing and Computers
Abud, Adam Abed
Bonaventura, Matias
Farina, Edoardo
Le Goff, Fabrice
Design of a Resilient, High-Throughput, Persistent Storage System for the ATLAS Phase-II DAQ System
title Design of a Resilient, High-Throughput, Persistent Storage System for the ATLAS Phase-II DAQ System
title_full Design of a Resilient, High-Throughput, Persistent Storage System for the ATLAS Phase-II DAQ System
title_fullStr Design of a Resilient, High-Throughput, Persistent Storage System for the ATLAS Phase-II DAQ System
title_full_unstemmed Design of a Resilient, High-Throughput, Persistent Storage System for the ATLAS Phase-II DAQ System
title_short Design of a Resilient, High-Throughput, Persistent Storage System for the ATLAS Phase-II DAQ System
title_sort design of a resilient, high-throughput, persistent storage system for the atlas phase-ii daq system
topic Particle Physics - Experiment
Computing and Computers
url https://dx.doi.org/10.1051/epjconf/202125104014
http://cds.cern.ch/record/2772665
work_keys_str_mv AT abudadamabed designofaresilienthighthroughputpersistentstoragesystemfortheatlasphaseiidaqsystem
AT bonaventuramatias designofaresilienthighthroughputpersistentstoragesystemfortheatlasphaseiidaqsystem
AT farinaedoardo designofaresilienthighthroughputpersistentstoragesystemfortheatlasphaseiidaqsystem
AT legofffabrice designofaresilienthighthroughputpersistentstoragesystemfortheatlasphaseiidaqsystem