Cargando…

C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA

We report the latest development in ROOT/TMVA, a new tool that takes trained ONNX deep learning models and emits C++ code that can be easily included and invoked for fast inference of the model, with minimal dependency. An introduction to SOFIE (System for Optimized Fast Inference code Emit) is pres...

Descripción completa

Detalles Bibliográficos
Autores principales: An, Sitong, Moneta, Lorenzo, Sengupta, Sanjiban, Hamdan, Ahmat, Sossai, Federico, Saxena, Aaradhya
Lenguaje:eng
Publicado: 2023
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/2438/1/012013
http://cds.cern.ch/record/2862109
Descripción
Sumario:We report the latest development in ROOT/TMVA, a new tool that takes trained ONNX deep learning models and emits C++ code that can be easily included and invoked for fast inference of the model, with minimal dependency. An introduction to SOFIE (System for Optimized Fast Inference code Emit) is presented, with examples of interface and generated code. We discuss the latest expanded support of a variety of neural network operators, including convolutional and recurrent layers, as well as the integration with RDataFrame. We demonstrate the latest performance of this framework with a set of benchmarks.