Cargando…

NaNet: a Low-Latency, Real-Time, Multi-Standard Network Interface Card with GPUDirect Features

While the GPGPU paradigm is widely recognized as an effective approach to high performance computing, its adoption in low-latency, real-time systems is still in its early stages. Although GPUs typically show deterministic behaviour in terms of latency in executing computational kernels as soon as da...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lonardo, A., Ameli, F., Ammendola, R., Biagioni, A., Frezza, O., Lamanna, G., Lo Cicero, F., Martinelli, M., Nicolau, C., Paolucci, P.S., Pastorelli, E., Pontisso, L., Rossetti, D., Simeone, F., Simula, F., Sozzi, M., Tosoratto, L., Vicini, P.
Lenguaje:	eng
Publicado:	2014
Materias:	physics.ins-det cs.AR Computing and Computers Detectors and Experimental Techniques
Acceso en línea:	http://cds.cern.ch/record/1709601

_version_	1780936646736216064
author	Lonardo, A. Ameli, F. Ammendola, R. Biagioni, A. Frezza, O. Lamanna, G. Lo Cicero, F. Martinelli, M. Nicolau, C. Paolucci, P.S. Pastorelli, E. Pontisso, L. Rossetti, D. Simeone, F. Simula, F. Sozzi, M. Tosoratto, L. Vicini, P.
author_facet	Lonardo, A. Ameli, F. Ammendola, R. Biagioni, A. Frezza, O. Lamanna, G. Lo Cicero, F. Martinelli, M. Nicolau, C. Paolucci, P.S. Pastorelli, E. Pontisso, L. Rossetti, D. Simeone, F. Simula, F. Sozzi, M. Tosoratto, L. Vicini, P.
author_sort	Lonardo, A.
collection	CERN
description	While the GPGPU paradigm is widely recognized as an effective approach to high performance computing, its adoption in low-latency, real-time systems is still in its early stages. Although GPUs typically show deterministic behaviour in terms of latency in executing computational kernels as soon as data is available in their internal memories, assessment of real-time features of a standard GPGPU system needs careful characterization of all subsystems along data stream path. The networking subsystem results in being the most critical one in terms of absolute value and fluctuations of its response latency. Our envisioned solution to this issue is NaNet, a FPGA-based PCIe Network Interface Card (NIC) design featuring a configurable and extensible set of network channels with direct access through GPUDirect to NVIDIA Fermi/Kepler GPU memories. NaNet design currently supports both standard - GbE (1000BASE-T) and 10GbE (10Base-R) - and custom - 34~Gbps APElink and 2.5~Gbps deterministic latency KM3link - channels, but its modularity allows for a straightforward inclusion of other link technologies. To avoid host OS intervention on data stream and remove a possible source of jitter, the design includes a network/transport layer offload module with cycle-accurate, upper-bound latency, supporting UDP, KM3link Time Division Multiplexing and APElink protocols. After NaNet architecture description and its latency/bandwidth characterization for all supported links, two real world use cases will be presented: the GPU-based low level trigger for the RICH detector in the NA62 experiment at CERN and the on-/off-shore data link for KM3 underwater neutrino telescope.
id	cern-1709601
institution	Organización Europea para la Investigación Nuclear
language	eng
publishDate	2014
record_format	invenio
spelling	cern-17096012023-03-14T19:39:22Zhttp://cds.cern.ch/record/1709601engLonardo, A.Ameli, F.Ammendola, R.Biagioni, A.Frezza, O.Lamanna, G.Lo Cicero, F.Martinelli, M.Nicolau, C.Paolucci, P.S.Pastorelli, E.Pontisso, L.Rossetti, D.Simeone, F.Simula, F.Sozzi, M.Tosoratto, L.Vicini, P.NaNet: a Low-Latency, Real-Time, Multi-Standard Network Interface Card with GPUDirect Featuresphysics.ins-detcs.ARComputing and ComputersDetectors and Experimental TechniquesWhile the GPGPU paradigm is widely recognized as an effective approach to high performance computing, its adoption in low-latency, real-time systems is still in its early stages. Although GPUs typically show deterministic behaviour in terms of latency in executing computational kernels as soon as data is available in their internal memories, assessment of real-time features of a standard GPGPU system needs careful characterization of all subsystems along data stream path. The networking subsystem results in being the most critical one in terms of absolute value and fluctuations of its response latency. Our envisioned solution to this issue is NaNet, a FPGA-based PCIe Network Interface Card (NIC) design featuring a configurable and extensible set of network channels with direct access through GPUDirect to NVIDIA Fermi/Kepler GPU memories. NaNet design currently supports both standard - GbE (1000BASE-T) and 10GbE (10Base-R) - and custom - 34~Gbps APElink and 2.5~Gbps deterministic latency KM3link - channels, but its modularity allows for a straightforward inclusion of other link technologies. To avoid host OS intervention on data stream and remove a possible source of jitter, the design includes a network/transport layer offload module with cycle-accurate, upper-bound latency, supporting UDP, KM3link Time Division Multiplexing and APElink protocols. After NaNet architecture description and its latency/bandwidth characterization for all supported links, two real world use cases will be presented: the GPU-based low level trigger for the RICH detector in the NA62 experiment at CERN and the on-/off-shore data link for KM3 underwater neutrino telescope.While the GPGPU paradigm is widely recognized as an effective approach to high performance computing, its adoption in low-latency, real-time systems is still in its early stages. Although GPUs typically show deterministic behaviour in terms of latency in executing computational kernels as soon as data is available in their internal memories, assessment of real-time features of a standard GPGPU system needs careful characterization of all subsystems along data stream path. The networking subsystem results in being the most critical one in terms of absolute value and fluctuations of its response latency. Our envisioned solution to this issue is NaNet, a FPGA-based PCIe Network Interface Card (NIC) design featuring a configurable and extensible set of network channels with direct access through GPUDirect to NVIDIA Fermi/Kepler GPU memories. NaNet design currently supports both standard - GbE (1000BASE-T) and 10GbE (10Base-R) - and custom - 34~Gbps APElink and 2.5~Gbps deterministic latency KM3link - channels, but its modularity allows for a straightforward inclusion of other link technologies. To avoid host OS intervention on data stream and remove a possible source of jitter, the design includes a network/transport layer offload module with cycle-accurate, upper-bound latency, supporting UDP, KM3link Time Division Multiplexing and APElink protocols. After NaNet architecture description and its latency/bandwidth characterization for all supported links, two real world use cases will be presented: the GPU-based low level trigger for the RICH detector in the NA62 experiment at CERN and the on-/off-shore data link for KM3 underwater neutrino telescope.arXiv:1406.3568oai:cds.cern.ch:17096012014-06-13
spellingShingle	physics.ins-det cs.AR Computing and Computers Detectors and Experimental Techniques Lonardo, A. Ameli, F. Ammendola, R. Biagioni, A. Frezza, O. Lamanna, G. Lo Cicero, F. Martinelli, M. Nicolau, C. Paolucci, P.S. Pastorelli, E. Pontisso, L. Rossetti, D. Simeone, F. Simula, F. Sozzi, M. Tosoratto, L. Vicini, P. NaNet: a Low-Latency, Real-Time, Multi-Standard Network Interface Card with GPUDirect Features
title	NaNet: a Low-Latency, Real-Time, Multi-Standard Network Interface Card with GPUDirect Features
title_full	NaNet: a Low-Latency, Real-Time, Multi-Standard Network Interface Card with GPUDirect Features
title_fullStr	NaNet: a Low-Latency, Real-Time, Multi-Standard Network Interface Card with GPUDirect Features
title_full_unstemmed	NaNet: a Low-Latency, Real-Time, Multi-Standard Network Interface Card with GPUDirect Features
title_short	NaNet: a Low-Latency, Real-Time, Multi-Standard Network Interface Card with GPUDirect Features
title_sort	nanet: a low-latency, real-time, multi-standard network interface card with gpudirect features
topic	physics.ins-det cs.AR Computing and Computers Detectors and Experimental Techniques
url	http://cds.cern.ch/record/1709601
work_keys_str_mv	AT lonardoa nanetalowlatencyrealtimemultistandardnetworkinterfacecardwithgpudirectfeatures AT amelif nanetalowlatencyrealtimemultistandardnetworkinterfacecardwithgpudirectfeatures AT ammendolar nanetalowlatencyrealtimemultistandardnetworkinterfacecardwithgpudirectfeatures AT biagionia nanetalowlatencyrealtimemultistandardnetworkinterfacecardwithgpudirectfeatures AT frezzao nanetalowlatencyrealtimemultistandardnetworkinterfacecardwithgpudirectfeatures AT lamannag nanetalowlatencyrealtimemultistandardnetworkinterfacecardwithgpudirectfeatures AT locicerof nanetalowlatencyrealtimemultistandardnetworkinterfacecardwithgpudirectfeatures AT martinellim nanetalowlatencyrealtimemultistandardnetworkinterfacecardwithgpudirectfeatures AT nicolauc nanetalowlatencyrealtimemultistandardnetworkinterfacecardwithgpudirectfeatures AT paoluccips nanetalowlatencyrealtimemultistandardnetworkinterfacecardwithgpudirectfeatures AT pastorellie nanetalowlatencyrealtimemultistandardnetworkinterfacecardwithgpudirectfeatures AT pontissol nanetalowlatencyrealtimemultistandardnetworkinterfacecardwithgpudirectfeatures AT rossettid nanetalowlatencyrealtimemultistandardnetworkinterfacecardwithgpudirectfeatures AT simeonef nanetalowlatencyrealtimemultistandardnetworkinterfacecardwithgpudirectfeatures AT simulaf nanetalowlatencyrealtimemultistandardnetworkinterfacecardwithgpudirectfeatures AT sozzim nanetalowlatencyrealtimemultistandardnetworkinterfacecardwithgpudirectfeatures AT tosorattol nanetalowlatencyrealtimemultistandardnetworkinterfacecardwithgpudirectfeatures AT vicinip nanetalowlatencyrealtimemultistandardnetworkinterfacecardwithgpudirectfeatures

NaNet: a Low-Latency, Real-Time, Multi-Standard Network Interface Card with GPUDirect Features

Ejemplares similares