Cargando…

Fast convolutional neural networks on FPGAs with hls4ml

We introduce an automated tool for deploying ultra low-latency, low-power deep neural networks with convolutional layers on field-programmable gate arrays (FPGAs). By extending the hls4ml library, we demonstrate an inference latency of 5 µs using convolutional architectures, targeting microsecond la...

Descripción completa

Detalles Bibliográficos
Autores principales:	Aarrestad, Thea, Loncar, Vladimir, Ghielmetti, Nicolò, Pierini, Maurizio, Summers, Sioni, Ngadiuba, Jennifer, Petersson, Christoffer, Linander, Hampus, Iiyama, Yutaro, Di Guglielmo, Giuseppe, Duarte, Javier, Harris, Philip, Rankin, Dylan, Jindariani, Sergo, Pedro, Kevin, Tran, Nhan, Liu, Mia, Kreinar, Edward, Wu, Zhenbin, Hoang, Duc
Lenguaje:	eng
Publicado:	2021
Materias:	stat.ML Mathematical Physics and Mathematics physics.ins-det Detectors and Experimental Techniques hep-ex Particle Physics - Experiment cs.CV Computing and Computers cs.LG
Acceso en línea:	https://dx.doi.org/10.1088/2632-2153/ac0ea1 http://cds.cern.ch/record/2751704

Ejemplares similares

Product Jacobi-Theta Boltzmann machines with score matching
por: Pasquale, Andrea, et al.
Publicado: (2023)

End-to-end Sinkhorn Autoencoder with Noise Generator
por: Deja, Kamil, et al.
Publicado: (2020)

Compressing deep neural networks on FPGAs to binary and ternary precision with HLS4ML
por: Loncar, Vladimir, et al.
Publicado: (2021)

QONNX: Representing Arbitrary-Precision Quantized Neural Networks
por: Pappalardo, Alessandro, et al.
Publicado: (2022)

Real-time semantic segmentation on FPGAs for autonomous vehicles with hls4ml
por: Ghielmetti, Nicolò, et al.
Publicado: (2022)

Towards Optimal Compression: Joint Pruning and Quantization
por: Zandonati, Ben, et al.
Publicado: (2023)

Technical Report of Participation in Higgs Boson Machine Learning Challenge
por: Ahmad, S. Raza
Publicado: (2015)

Ultra-low latency recurrent neural network inference on FPGAs for physics applications with hls4ml
por: Khoda, Elham E., et al.
Publicado: (2022)

Image-based model parameter optimisation using Model-Assisted Generative Adversarial Networks
por: Alonso-Monsalve, Saúl, et al.
Publicado: (2018)

Sampling the Riemann-Theta Boltzmann Machine
por: Carrazza, Stefano, et al.
Publicado: (2018)

Open-source FPGA-ML codesign for the MLPerf Tiny Benchmark
por: Borras, Hendrik, et al.
Publicado: (2022)

hls4ml: An Open-Source Codesign Workflow to Empower Scientific Low-Power Machine Learning Devices
por: Fahim, Farah, et al.
Publicado: (2021)

Ethnicity Sensitive Author Disambiguation Using Semi-supervised Learning
por: Louppe, Gilles, et al.
Publicado: (2015)

Fast inference of deep neural networks in FPGAs for particle physics
por: Duarte, Javier, et al.
Publicado: (2018)

hls4ml: deploying deep learning on FPGAs for L1 trigger and Data Acquisition
por: Duarte, Javier, et al.
Publicado: (2019)

Automated visual inspection of CMS HGCAL silicon sensor surface using an ensemble of a deep convolutional autoencoder and classifier
por: Grönroos, Sonja, et al.
Publicado: (2023)

Arhuaco: Deep Learning and Isolation Based Security for Distributed High-Throughput Computing
por: Gomez Ramirez, A., et al.
Publicado: (2018)

Lightweight Jet Reconstruction and Identification as an Object Detection Task
por: Pol, Adrian Alan, et al.
Publicado: (2022)

Accelerating Recurrent Neural Networks for Gravitational Wave Experiments
por: Que, Zhiqiang, et al.
Publicado: (2021)

Fast inference of Boosted Decision Trees in FPGAs for particle physics
por: Summers, Sioni, et al.
Publicado: (2020)

Distance-Weighted Graph Neural Networks on FPGAs for Real-Time Particle Reconstruction in High Energy Physics
por: Iiyama, Yutaro, et al.
Publicado: (2020)

End-to-End Physics Event Classification with the CMS Open Data: Applying Image-based Deep Learning on Detector Data to Directly Classify Collision Events at the LHC
por: Andrews, M., et al.
Publicado: (2018)

End-to-End Jet Classification of Quarks and Gluons with the CMS Open Data
por: Andrews, M., et al.
Publicado: (2019)

End-to-End Jet Classification of Boosted Top Quarks with CMS Open Data
por: Andrews, Michael, et al.
Publicado: (2021)

Accelerated Charged Particle Tracking with Graph Neural Networks on FPGAs
por: Heintz, Aneesh, et al.
Publicado: (2020)

Leveraging universality of jet taggers through transfer learning
por: Dreyer, Frédéric A., et al.
Publicado: (2022)

Jet tagging in the Lund plane with graph networks
por: Dreyer, Frédéric A., et al.
Publicado: (2020)

Machine Learning methods for simulating particle response in the Zero Degree Calorimeter at the ALICE experiment, CERN
por: Dubiński, Jan, et al.
Publicado: (2023)

Calorimetry with Deep Learning: Particle Simulation and Reconstruction for Collider Physics
por: Belayneh, Dawit, et al.
Publicado: (2019)

End-to-end multi-particle reconstruction in high occupancy imaging calorimeters with graph neural networks
por: Qasim, Shah Rukh, et al.
Publicado: (2022)

Machine Learning Pipelines with Modern Big Data Tools for High Energy Physics
por: Migliorini, Matteo, et al.
Publicado: (2019)

Symbolic Regression on FPGAs for Fast Machine Learning Inference
por: Tsoi, Ho Fung, et al.
Publicado: (2023)

Variational Dropout Sparsification for Particle Identification speed-up
por: Ryzhikov, Artem, et al.
Publicado: (2020)

Hyperparameter optimization, quantum-assisted model performance prediction, and benchmarking of AI-based High Energy Physics workloads using HPC
por: Wulff, Eric, et al.
Publicado: (2023)

Generative Models for Fast Calorimeter Simulation: the LHCb case
por: Chekalina, Viktoria, et al.
Publicado: (2019)

Micro-CernVM: Slashing the Cost of Building and Deploying Virtual Machines
por: Blomer, J., et al.
Publicado: (2014)

Visualization of Publication Impact
por: Maguire, Eamonn, et al.
Publicado: (2016)

Mixed Quantum–Classical Method for Fraud Detection With Quantum Feature Selection
por: Grossi, Michele, et al.
Publicado: (2022)

Running the Dual-PQC GAN on noisy simulators and real quantum hardware
por: Chang, Su Yeon, et al.
Publicado: (2023)

Scalable Global Grid catalogue for LHC Run3 and beyond
por: Martinez Pedreira, Miguel, et al.
Publicado: (2017)

Cannot write session to /tmp/vufind_sessions/sess_nroefj4re7vctr06s2pgtg4uai