Cargando…
Benchmarking Data Acquisition event building network performance for the ATLAS HL-LHC upgrade
The ATLAS experiment Data Acquisition (DAQ) system will face an extensive upgrade to fully exploit the High-Luminosity LHC (HL-LHC) upgrade, allowing it to record data at unprecedented rates. The detector will be read out at 1 MHz generating over 5 TB/s of data which is sent to approximately 600 ser...
Autores principales: | , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2023
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/2873584 |
_version_ | 1780978645794291712 |
---|---|
author | Pozo Astigarraga, Eukeni Bonaventura, Matias Maple, James Pecker Marcosig, Ezequiel Levrini, Giacomo Castro, Rodrigo Daniel |
author_facet | Pozo Astigarraga, Eukeni Bonaventura, Matias Maple, James Pecker Marcosig, Ezequiel Levrini, Giacomo Castro, Rodrigo Daniel |
author_sort | Pozo Astigarraga, Eukeni |
collection | CERN |
description | The ATLAS experiment Data Acquisition (DAQ) system will face an extensive upgrade to fully exploit the High-Luminosity LHC (HL-LHC) upgrade, allowing it to record data at unprecedented rates. The detector will be read out at 1 MHz generating over 5 TB/s of data which is sent to approximately 600 servers. These data are then transported to the processing farm, comprising approximately 3000 servers, for a further rate reduction. This design poses significant challenges for the Ethernet-based network as it will be required to transport 20 times more data than the current system. The increased data rate, data sizes, and the number of servers will exacerbate the already diagnosed TCP incast effect, which makes it impossible to fully exploit the capabilities of the network and limits the performance of the processing farm. In this paper, we present exhaustive systematic experiments to define buffer requirements in network equipment to minimise the impacts of TCP incast on the processing applications. Three switch models were stress-tested using DAQ traffic patterns as synthetic network load in a test environment at approximately 10% scale of the expected HL-LHC DAQ system size. As the HL-LHC system's desired hardware is not currently available and the lab size is considerably smaller, tests aim to project buffer requirements with different parameters. Thus, new analytic and simulation tools are introduced to support and scale lab projections. Different candidate solutions are analysed, comparing software-based and network hardware cost-to-performance ratios to determine the most effective option that can mitigate the impact of TCP incast. The results of these evaluations will contribute to the decision-making process of acquiring network hardware for the HL-LHC data acquisition. |
id | cern-2873584 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2023 |
record_format | invenio |
spelling | cern-28735842023-10-03T21:22:43Zhttp://cds.cern.ch/record/2873584engPozo Astigarraga, EukeniBonaventura, MatiasMaple, JamesPecker Marcosig, EzequielLevrini, GiacomoCastro, Rodrigo DanielBenchmarking Data Acquisition event building network performance for the ATLAS HL-LHC upgradeParticle Physics - ExperimentThe ATLAS experiment Data Acquisition (DAQ) system will face an extensive upgrade to fully exploit the High-Luminosity LHC (HL-LHC) upgrade, allowing it to record data at unprecedented rates. The detector will be read out at 1 MHz generating over 5 TB/s of data which is sent to approximately 600 servers. These data are then transported to the processing farm, comprising approximately 3000 servers, for a further rate reduction. This design poses significant challenges for the Ethernet-based network as it will be required to transport 20 times more data than the current system. The increased data rate, data sizes, and the number of servers will exacerbate the already diagnosed TCP incast effect, which makes it impossible to fully exploit the capabilities of the network and limits the performance of the processing farm. In this paper, we present exhaustive systematic experiments to define buffer requirements in network equipment to minimise the impacts of TCP incast on the processing applications. Three switch models were stress-tested using DAQ traffic patterns as synthetic network load in a test environment at approximately 10% scale of the expected HL-LHC DAQ system size. As the HL-LHC system's desired hardware is not currently available and the lab size is considerably smaller, tests aim to project buffer requirements with different parameters. Thus, new analytic and simulation tools are introduced to support and scale lab projections. Different candidate solutions are analysed, comparing software-based and network hardware cost-to-performance ratios to determine the most effective option that can mitigate the impact of TCP incast. The results of these evaluations will contribute to the decision-making process of acquiring network hardware for the HL-LHC data acquisition.ATL-DAQ-SLIDE-2023-533oai:cds.cern.ch:28735842023-10-03 |
spellingShingle | Particle Physics - Experiment Pozo Astigarraga, Eukeni Bonaventura, Matias Maple, James Pecker Marcosig, Ezequiel Levrini, Giacomo Castro, Rodrigo Daniel Benchmarking Data Acquisition event building network performance for the ATLAS HL-LHC upgrade |
title | Benchmarking Data Acquisition event building network performance for the ATLAS HL-LHC upgrade |
title_full | Benchmarking Data Acquisition event building network performance for the ATLAS HL-LHC upgrade |
title_fullStr | Benchmarking Data Acquisition event building network performance for the ATLAS HL-LHC upgrade |
title_full_unstemmed | Benchmarking Data Acquisition event building network performance for the ATLAS HL-LHC upgrade |
title_short | Benchmarking Data Acquisition event building network performance for the ATLAS HL-LHC upgrade |
title_sort | benchmarking data acquisition event building network performance for the atlas hl-lhc upgrade |
topic | Particle Physics - Experiment |
url | http://cds.cern.ch/record/2873584 |
work_keys_str_mv | AT pozoastigarragaeukeni benchmarkingdataacquisitioneventbuildingnetworkperformancefortheatlashllhcupgrade AT bonaventuramatias benchmarkingdataacquisitioneventbuildingnetworkperformancefortheatlashllhcupgrade AT maplejames benchmarkingdataacquisitioneventbuildingnetworkperformancefortheatlashllhcupgrade AT peckermarcosigezequiel benchmarkingdataacquisitioneventbuildingnetworkperformancefortheatlashllhcupgrade AT levrinigiacomo benchmarkingdataacquisitioneventbuildingnetworkperformancefortheatlashllhcupgrade AT castrorodrigodaniel benchmarkingdataacquisitioneventbuildingnetworkperformancefortheatlashllhcupgrade |