Cargando…

Using ALFA for high throughput, distributed data transmission in the ALICE O$^2$ system

ALICE (A Large Ion Collider Experiment) is a heavy-ion detector designed to study the physics of strongly interacting matter (the Quark–Gluon Plasma at the CERN LHC (Large Hadron Collider). ALICE has been successfully collecting physics data in Run 2 since spring 2015. In parallel, preparations for...

Descripción completa

Detalles Bibliográficos
Autor principal: Wegrzynek, A
Lenguaje:eng
Publicado: 2017
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/898/3/032026
http://cds.cern.ch/record/2297451
_version_ 1780956895817760768
author Wegrzynek, A
author_facet Wegrzynek, A
author_sort Wegrzynek, A
collection CERN
description ALICE (A Large Ion Collider Experiment) is a heavy-ion detector designed to study the physics of strongly interacting matter (the Quark–Gluon Plasma at the CERN LHC (Large Hadron Collider). ALICE has been successfully collecting physics data in Run 2 since spring 2015. In parallel, preparations for a major upgrade of the computing system, called O2 (Online-Offline), scheduled for the Long Shutdown 2 in 2019-2020, are being made. One of the major requirements of the system is the capacity to transport data between so-called FLPs (First Level Processors), equipped with readout cards, and the EPNs (Event Processing Node), performing data aggregation, frame building and partial reconstruction. It is foreseen to have 268 FLPs dispatching data to 1500 EPNs with an average output of 20 Gb/s each. In overall, the O2 processing system will operate at terabits per second of throughput while handling millions of concurrent connections. The ALFA framework will standardize and handle software related tasks such as readout, data transport, frame building, calibration, online reconstruction and more in the upgraded computing system. ALFA supports two data transport libraries: ZeroMQ and nanomsg. This paper discusses the efficiency of ALFA in terms of high throughput data transport. The tests were performed with multiple FLPs pushing data to multiple EPNs. The transfer was done using push-pull communication patterns and two socket configurations: bind, connect. The set of benchmarks was prepared to get the most performant results on each hardware setup. The paper presents the measurement process and final results – data throughput combined with computing resources usage as a function of block size. The high number of nodes and connections in the final set up may cause race conditions that can lead to uneven load balancing and poor scalability. The performed tests allow us to validate whether the traffic is distributed evenly over all receivers. It also measures the behaviour of the network in saturation and evaluates scalability from a 1-to-1 to a N-to-M solution.
id oai-inspirehep.net-1638251
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2017
record_format invenio
spelling oai-inspirehep.net-16382512021-02-09T10:07:14Zdoi:10.1088/1742-6596/898/3/032026http://cds.cern.ch/record/2297451engWegrzynek, AUsing ALFA for high throughput, distributed data transmission in the ALICE O$^2$ systemComputing and ComputersDetectors and Experimental TechniquesALICE (A Large Ion Collider Experiment) is a heavy-ion detector designed to study the physics of strongly interacting matter (the Quark–Gluon Plasma at the CERN LHC (Large Hadron Collider). ALICE has been successfully collecting physics data in Run 2 since spring 2015. In parallel, preparations for a major upgrade of the computing system, called O2 (Online-Offline), scheduled for the Long Shutdown 2 in 2019-2020, are being made. One of the major requirements of the system is the capacity to transport data between so-called FLPs (First Level Processors), equipped with readout cards, and the EPNs (Event Processing Node), performing data aggregation, frame building and partial reconstruction. It is foreseen to have 268 FLPs dispatching data to 1500 EPNs with an average output of 20 Gb/s each. In overall, the O2 processing system will operate at terabits per second of throughput while handling millions of concurrent connections. The ALFA framework will standardize and handle software related tasks such as readout, data transport, frame building, calibration, online reconstruction and more in the upgraded computing system. ALFA supports two data transport libraries: ZeroMQ and nanomsg. This paper discusses the efficiency of ALFA in terms of high throughput data transport. The tests were performed with multiple FLPs pushing data to multiple EPNs. The transfer was done using push-pull communication patterns and two socket configurations: bind, connect. The set of benchmarks was prepared to get the most performant results on each hardware setup. The paper presents the measurement process and final results – data throughput combined with computing resources usage as a function of block size. The high number of nodes and connections in the final set up may cause race conditions that can lead to uneven load balancing and poor scalability. The performed tests allow us to validate whether the traffic is distributed evenly over all receivers. It also measures the behaviour of the network in saturation and evaluates scalability from a 1-to-1 to a N-to-M solution.oai:inspirehep.net:16382512017
spellingShingle Computing and Computers
Detectors and Experimental Techniques
Wegrzynek, A
Using ALFA for high throughput, distributed data transmission in the ALICE O$^2$ system
title Using ALFA for high throughput, distributed data transmission in the ALICE O$^2$ system
title_full Using ALFA for high throughput, distributed data transmission in the ALICE O$^2$ system
title_fullStr Using ALFA for high throughput, distributed data transmission in the ALICE O$^2$ system
title_full_unstemmed Using ALFA for high throughput, distributed data transmission in the ALICE O$^2$ system
title_short Using ALFA for high throughput, distributed data transmission in the ALICE O$^2$ system
title_sort using alfa for high throughput, distributed data transmission in the alice o$^2$ system
topic Computing and Computers
Detectors and Experimental Techniques
url https://dx.doi.org/10.1088/1742-6596/898/3/032026
http://cds.cern.ch/record/2297451
work_keys_str_mv AT wegrzyneka usingalfaforhighthroughputdistributeddatatransmissioninthealiceo2system