Cargando…
Design of a Resilient, High-Throughput, Persistent Storage System for the ATLAS Phase-II DAQ System
The ATLAS experiment will undergo a major upgrade to take advantage of the new conditions provided by the upgraded High-Luminosity LHC. The Trigger and Data Acquisition system (TDAQ) will record data at unprecedented rates: the detectors will be read out at 1 MHz generating around 5 TB/s of data. Th...
Autores principales: | , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2021
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1051/epjconf/202125104014 http://cds.cern.ch/record/2772665 |
_version_ | 1780971464144453632 |
---|---|
author | Abud, Adam Abed Bonaventura, Matias Farina, Edoardo Le Goff, Fabrice |
author_facet | Abud, Adam Abed Bonaventura, Matias Farina, Edoardo Le Goff, Fabrice |
author_sort | Abud, Adam Abed |
collection | CERN |
description | The ATLAS experiment will undergo a major upgrade to take advantage of the new conditions provided by the upgraded High-Luminosity LHC. The Trigger and Data Acquisition system (TDAQ) will record data at unprecedented rates: the detectors will be read out at 1 MHz generating around 5 TB/s of data. The Dataflow system (DF), component of TDAQ, introduces a novel design: readout data are buffered on persistent storage while the event filtering system analyses them to select 10000 events per second for a total recorded throughput of around 60 GB/s. This approach allows for decoupling the detector activity from the event selection process. New challenges then arise for DF: design and implement a distributed, reliable, persistent storage system supporting several TB/s of aggregated throughput while providing tens of PB of capacity.In this paper we first describe some of the challenges that DF is facing: data safety with persistent storage limitations, indexing of data at high-granularity in a highly-distributed system, and high-performance management of storage capacity. Then the ongoing R&D to address each of the them is presented and the performance achieved with a working prototype is shown. |
id | cern-2772665 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2021 |
record_format | invenio |
spelling | cern-27726652022-08-22T20:09:01Zdoi:10.1051/epjconf/202125104014http://cds.cern.ch/record/2772665engAbud, Adam AbedBonaventura, MatiasFarina, EdoardoLe Goff, FabriceDesign of a Resilient, High-Throughput, Persistent Storage System for the ATLAS Phase-II DAQ SystemParticle Physics - ExperimentComputing and ComputersThe ATLAS experiment will undergo a major upgrade to take advantage of the new conditions provided by the upgraded High-Luminosity LHC. The Trigger and Data Acquisition system (TDAQ) will record data at unprecedented rates: the detectors will be read out at 1 MHz generating around 5 TB/s of data. The Dataflow system (DF), component of TDAQ, introduces a novel design: readout data are buffered on persistent storage while the event filtering system analyses them to select 10000 events per second for a total recorded throughput of around 60 GB/s. This approach allows for decoupling the detector activity from the event selection process. New challenges then arise for DF: design and implement a distributed, reliable, persistent storage system supporting several TB/s of aggregated throughput while providing tens of PB of capacity.In this paper we first describe some of the challenges that DF is facing: data safety with persistent storage limitations, indexing of data at high-granularity in a highly-distributed system, and high-performance management of storage capacity. Then the ongoing R&D to address each of the them is presented and the performance achieved with a working prototype is shown.The ATLAS experiment will undergo a major upgrade to take advantage of the new conditions provided by the upgraded High-Luminosity LHC. The Trigger and Data Acquisition system (TDAQ) will record data at unprecedented rates: the detectors will be read out at 1 MHz generating around 5 TB/s of data. The Dataflow system (DF), component of TDAQ, introduces a novel design: readout data are buffered on persistent storage while the event filtering system analyses them to select 10000 events per second for a total recorded throughput of around 60 GB/s. This approach allows for decoupling the detector activity from the event selection process. New challenges then arise for DF: design and implement a distributed, reliable, persistent storage system supporting several TB/s of aggregated throughput while providing tens of PB of capacity. In this paper we first describe some of the challenges that DF is facing: data safety with persistent storage limitations, indexing of data at high-granularity in a highly-distributed system, and high-performance management of storage capacity. Then the ongoing R&D; to address each of the them is presented and the performance achieved with a working prototype is shown.ATL-DAQ-PROC-2021-002oai:cds.cern.ch:27726652021-06-14 |
spellingShingle | Particle Physics - Experiment Computing and Computers Abud, Adam Abed Bonaventura, Matias Farina, Edoardo Le Goff, Fabrice Design of a Resilient, High-Throughput, Persistent Storage System for the ATLAS Phase-II DAQ System |
title | Design of a Resilient, High-Throughput, Persistent Storage System for the ATLAS Phase-II DAQ System |
title_full | Design of a Resilient, High-Throughput, Persistent Storage System for the ATLAS Phase-II DAQ System |
title_fullStr | Design of a Resilient, High-Throughput, Persistent Storage System for the ATLAS Phase-II DAQ System |
title_full_unstemmed | Design of a Resilient, High-Throughput, Persistent Storage System for the ATLAS Phase-II DAQ System |
title_short | Design of a Resilient, High-Throughput, Persistent Storage System for the ATLAS Phase-II DAQ System |
title_sort | design of a resilient, high-throughput, persistent storage system for the atlas phase-ii daq system |
topic | Particle Physics - Experiment Computing and Computers |
url | https://dx.doi.org/10.1051/epjconf/202125104014 http://cds.cern.ch/record/2772665 |
work_keys_str_mv | AT abudadamabed designofaresilienthighthroughputpersistentstoragesystemfortheatlasphaseiidaqsystem AT bonaventuramatias designofaresilienthighthroughputpersistentstoragesystemfortheatlasphaseiidaqsystem AT farinaedoardo designofaresilienthighthroughputpersistentstoragesystemfortheatlasphaseiidaqsystem AT legofffabrice designofaresilienthighthroughputpersistentstoragesystemfortheatlasphaseiidaqsystem |