Cargando…
Using ALFA for high throughput, distributed data transmission in the ALICE O$^2$ system
ALICE (A Large Ion Collider Experiment) is a heavy-ion detector designed to study the physics of strongly interacting matter (the Quark–Gluon Plasma at the CERN LHC (Large Hadron Collider). ALICE has been successfully collecting physics data in Run 2 since spring 2015. In parallel, preparations for...
Autor principal: | |
---|---|
Lenguaje: | eng |
Publicado: |
2017
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1088/1742-6596/898/3/032026 http://cds.cern.ch/record/2297451 |
_version_ | 1780956895817760768 |
---|---|
author | Wegrzynek, A |
author_facet | Wegrzynek, A |
author_sort | Wegrzynek, A |
collection | CERN |
description | ALICE (A Large Ion Collider Experiment) is a heavy-ion detector designed to study the physics of strongly interacting matter (the Quark–Gluon Plasma at the CERN LHC (Large Hadron Collider). ALICE has been successfully collecting physics data in Run 2 since spring 2015. In parallel, preparations for a major upgrade of the computing system, called O2 (Online-Offline), scheduled for the Long Shutdown 2 in 2019-2020, are being made. One of the major requirements of the system is the capacity to transport data between so-called FLPs (First Level Processors), equipped with readout cards, and the EPNs (Event Processing Node), performing data aggregation, frame building and partial reconstruction. It is foreseen to have 268 FLPs dispatching data to 1500 EPNs with an average output of 20 Gb/s each. In overall, the O2 processing system will operate at terabits per second of throughput while handling millions of concurrent connections. The ALFA framework will standardize and handle software related tasks such as readout, data transport, frame building, calibration, online reconstruction and more in the upgraded computing system. ALFA supports two data transport libraries: ZeroMQ and nanomsg. This paper discusses the efficiency of ALFA in terms of high throughput data transport. The tests were performed with multiple FLPs pushing data to multiple EPNs. The transfer was done using push-pull communication patterns and two socket configurations: bind, connect. The set of benchmarks was prepared to get the most performant results on each hardware setup. The paper presents the measurement process and final results – data throughput combined with computing resources usage as a function of block size. The high number of nodes and connections in the final set up may cause race conditions that can lead to uneven load balancing and poor scalability. The performed tests allow us to validate whether the traffic is distributed evenly over all receivers. It also measures the behaviour of the network in saturation and evaluates scalability from a 1-to-1 to a N-to-M solution. |
id | oai-inspirehep.net-1638251 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2017 |
record_format | invenio |
spelling | oai-inspirehep.net-16382512021-02-09T10:07:14Zdoi:10.1088/1742-6596/898/3/032026http://cds.cern.ch/record/2297451engWegrzynek, AUsing ALFA for high throughput, distributed data transmission in the ALICE O$^2$ systemComputing and ComputersDetectors and Experimental TechniquesALICE (A Large Ion Collider Experiment) is a heavy-ion detector designed to study the physics of strongly interacting matter (the Quark–Gluon Plasma at the CERN LHC (Large Hadron Collider). ALICE has been successfully collecting physics data in Run 2 since spring 2015. In parallel, preparations for a major upgrade of the computing system, called O2 (Online-Offline), scheduled for the Long Shutdown 2 in 2019-2020, are being made. One of the major requirements of the system is the capacity to transport data between so-called FLPs (First Level Processors), equipped with readout cards, and the EPNs (Event Processing Node), performing data aggregation, frame building and partial reconstruction. It is foreseen to have 268 FLPs dispatching data to 1500 EPNs with an average output of 20 Gb/s each. In overall, the O2 processing system will operate at terabits per second of throughput while handling millions of concurrent connections. The ALFA framework will standardize and handle software related tasks such as readout, data transport, frame building, calibration, online reconstruction and more in the upgraded computing system. ALFA supports two data transport libraries: ZeroMQ and nanomsg. This paper discusses the efficiency of ALFA in terms of high throughput data transport. The tests were performed with multiple FLPs pushing data to multiple EPNs. The transfer was done using push-pull communication patterns and two socket configurations: bind, connect. The set of benchmarks was prepared to get the most performant results on each hardware setup. The paper presents the measurement process and final results – data throughput combined with computing resources usage as a function of block size. The high number of nodes and connections in the final set up may cause race conditions that can lead to uneven load balancing and poor scalability. The performed tests allow us to validate whether the traffic is distributed evenly over all receivers. It also measures the behaviour of the network in saturation and evaluates scalability from a 1-to-1 to a N-to-M solution.oai:inspirehep.net:16382512017 |
spellingShingle | Computing and Computers Detectors and Experimental Techniques Wegrzynek, A Using ALFA for high throughput, distributed data transmission in the ALICE O$^2$ system |
title | Using ALFA for high throughput, distributed data transmission in the ALICE O$^2$ system |
title_full | Using ALFA for high throughput, distributed data transmission in the ALICE O$^2$ system |
title_fullStr | Using ALFA for high throughput, distributed data transmission in the ALICE O$^2$ system |
title_full_unstemmed | Using ALFA for high throughput, distributed data transmission in the ALICE O$^2$ system |
title_short | Using ALFA for high throughput, distributed data transmission in the ALICE O$^2$ system |
title_sort | using alfa for high throughput, distributed data transmission in the alice o$^2$ system |
topic | Computing and Computers Detectors and Experimental Techniques |
url | https://dx.doi.org/10.1088/1742-6596/898/3/032026 http://cds.cern.ch/record/2297451 |
work_keys_str_mv | AT wegrzyneka usingalfaforhighthroughputdistributeddatatransmissioninthealiceo2system |