Cargando…
Ethernet for High-Throughput Computing at CERN
When high throughput and utilization of fabric at close-to-the-link capacity are most needed in a cluster, Ethernet is a potential candidate, rivaling traditional HPC interconnects. The distributed real-time data acquisition at particle physics experiments presents an interesting use case. This arti...
Autores principales: | , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2022
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1109/tpds.2022.3163472 http://cds.cern.ch/record/2852832 |
_version_ | 1780977170951176192 |
---|---|
author | Krawczyk, Rafal Colombo, Tommaso Neufeld, Niko Pisani, Flavio Valat, Sebastien |
author_facet | Krawczyk, Rafal Colombo, Tommaso Neufeld, Niko Pisani, Flavio Valat, Sebastien |
author_sort | Krawczyk, Rafal |
collection | CERN |
description | When high throughput and utilization of fabric at close-to-the-link capacity are most needed in a cluster, Ethernet is a potential candidate, rivaling traditional HPC interconnects. The distributed real-time data acquisition at particle physics experiments presents an interesting use case. This article evaluates possible Ethernet-based solutions for aggregating data from hundreds of data sources at a throughput of dozens of Tb/s. This leads us to many-to-one data exchanges where we strive for a cost-optimized setup sustaining more than 80 % of the theoretical link-load. We investigate possible Ethernet-based traffic patterns to handle data acquisition on large multi-source apparatuses. Different numbers of producers and receivers and different link speeds are allowed in a large-scale network. Performance tests were conducted using customized benchmarks and evaluation test benches. The article presents tested scenarios and problems encountered in practice. We describe how our findings influenced the design of a large production system at CERN. We also present relevant general conclusions for a broader range of applications of Ethernet in HPC. |
id | cern-2852832 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2022 |
record_format | invenio |
spelling | cern-28528322023-03-21T15:32:00Zdoi:10.1109/tpds.2022.3163472http://cds.cern.ch/record/2852832engKrawczyk, RafalColombo, TommasoNeufeld, NikoPisani, FlavioValat, SebastienEthernet for High-Throughput Computing at CERNComputing and ComputersWhen high throughput and utilization of fabric at close-to-the-link capacity are most needed in a cluster, Ethernet is a potential candidate, rivaling traditional HPC interconnects. The distributed real-time data acquisition at particle physics experiments presents an interesting use case. This article evaluates possible Ethernet-based solutions for aggregating data from hundreds of data sources at a throughput of dozens of Tb/s. This leads us to many-to-one data exchanges where we strive for a cost-optimized setup sustaining more than 80 % of the theoretical link-load. We investigate possible Ethernet-based traffic patterns to handle data acquisition on large multi-source apparatuses. Different numbers of producers and receivers and different link speeds are allowed in a large-scale network. Performance tests were conducted using customized benchmarks and evaluation test benches. The article presents tested scenarios and problems encountered in practice. We describe how our findings influenced the design of a large production system at CERN. We also present relevant general conclusions for a broader range of applications of Ethernet in HPC.oai:cds.cern.ch:28528322022 |
spellingShingle | Computing and Computers Krawczyk, Rafal Colombo, Tommaso Neufeld, Niko Pisani, Flavio Valat, Sebastien Ethernet for High-Throughput Computing at CERN |
title | Ethernet for High-Throughput Computing at CERN |
title_full | Ethernet for High-Throughput Computing at CERN |
title_fullStr | Ethernet for High-Throughput Computing at CERN |
title_full_unstemmed | Ethernet for High-Throughput Computing at CERN |
title_short | Ethernet for High-Throughput Computing at CERN |
title_sort | ethernet for high-throughput computing at cern |
topic | Computing and Computers |
url | https://dx.doi.org/10.1109/tpds.2022.3163472 http://cds.cern.ch/record/2852832 |
work_keys_str_mv | AT krawczykrafal ethernetforhighthroughputcomputingatcern AT colombotommaso ethernetforhighthroughputcomputingatcern AT neufeldniko ethernetforhighthroughputcomputingatcern AT pisaniflavio ethernetforhighthroughputcomputingatcern AT valatsebastien ethernetforhighthroughputcomputingatcern |