Cargando…

FPGA-Accelerated Neural Network Inference for Ultra-Low-Latency Recalibration and Classification of Physics Objects at 40 MHz within CMS

<!--HTML-->In the realm of data processing and physics analysis at the Large Hadron Collider (LHC), there exists a notable advantage of deep learning-based algorithms over traditional physics-based counterparts. This study explores cutting-edge methodologies for the low latency neural network...

Descripción completa

Detalles Bibliográficos
Autor principal: Choudhury, Diptarko
Lenguaje:eng
Publicado: 2023
Materias:
Acceso en línea:http://cds.cern.ch/record/2868471
_version_ 1780978223301001216
author Choudhury, Diptarko
author_facet Choudhury, Diptarko
author_sort Choudhury, Diptarko
collection CERN
description <!--HTML-->In the realm of data processing and physics analysis at the Large Hadron Collider (LHC), there exists a notable advantage of deep learning-based algorithms over traditional physics-based counterparts. This study explores cutting-edge methodologies for the low latency neural network inference on Field Programmable Gate Array (FPGA) devices. Specifically, we concentrate on the recalibration and classification of physics objects at a demanding rate of 40 MHz within the CMS framework. The primary objective of this work is to develop an ultra-low-latency neural network model, strategically combining various techniques such as Quantization Aware Training, Knowledge Distillation, transfer learning, and Pruning schedules to achieve an exceptionally low latency without compromising the integral reconstruction performance.
id cern-2868471
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2023
record_format invenio
spelling cern-28684712023-08-25T20:29:07Zhttp://cds.cern.ch/record/2868471engChoudhury, DiptarkoFPGA-Accelerated Neural Network Inference for Ultra-Low-Latency Recalibration and Classification of Physics Objects at 40 MHz within CMSCERN openlab Summer Student Lightning Talks (2/2)CERN openlab Summer Student Programme 2023<!--HTML-->In the realm of data processing and physics analysis at the Large Hadron Collider (LHC), there exists a notable advantage of deep learning-based algorithms over traditional physics-based counterparts. This study explores cutting-edge methodologies for the low latency neural network inference on Field Programmable Gate Array (FPGA) devices. Specifically, we concentrate on the recalibration and classification of physics objects at a demanding rate of 40 MHz within the CMS framework. The primary objective of this work is to develop an ultra-low-latency neural network model, strategically combining various techniques such as Quantization Aware Training, Knowledge Distillation, transfer learning, and Pruning schedules to achieve an exceptionally low latency without compromising the integral reconstruction performance.oai:cds.cern.ch:28684712023
spellingShingle CERN openlab Summer Student Programme 2023
Choudhury, Diptarko
FPGA-Accelerated Neural Network Inference for Ultra-Low-Latency Recalibration and Classification of Physics Objects at 40 MHz within CMS
title FPGA-Accelerated Neural Network Inference for Ultra-Low-Latency Recalibration and Classification of Physics Objects at 40 MHz within CMS
title_full FPGA-Accelerated Neural Network Inference for Ultra-Low-Latency Recalibration and Classification of Physics Objects at 40 MHz within CMS
title_fullStr FPGA-Accelerated Neural Network Inference for Ultra-Low-Latency Recalibration and Classification of Physics Objects at 40 MHz within CMS
title_full_unstemmed FPGA-Accelerated Neural Network Inference for Ultra-Low-Latency Recalibration and Classification of Physics Objects at 40 MHz within CMS
title_short FPGA-Accelerated Neural Network Inference for Ultra-Low-Latency Recalibration and Classification of Physics Objects at 40 MHz within CMS
title_sort fpga-accelerated neural network inference for ultra-low-latency recalibration and classification of physics objects at 40 mhz within cms
topic CERN openlab Summer Student Programme 2023
url http://cds.cern.ch/record/2868471
work_keys_str_mv AT choudhurydiptarko fpgaacceleratedneuralnetworkinferenceforultralowlatencyrecalibrationandclassificationofphysicsobjectsat40mhzwithincms
AT choudhurydiptarko cernopenlabsummerstudentlightningtalks22