Cargando…
C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA
We report the latest development in ROOT/TMVA, a new tool that takes trained ONNX deep learning models and emits C++ code that can be easily included and invoked for fast inference of the model, with minimal dependency. An introduction to SOFIE (System for Optimized Fast Inference code Emit) is pres...
Autores principales: | , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2023
|
Acceso en línea: | https://dx.doi.org/10.1088/1742-6596/2438/1/012013 http://cds.cern.ch/record/2862109 |
_version_ | 1780977856954761216 |
---|---|
author | An, Sitong Moneta, Lorenzo Sengupta, Sanjiban Hamdan, Ahmat Sossai, Federico Saxena, Aaradhya |
author_facet | An, Sitong Moneta, Lorenzo Sengupta, Sanjiban Hamdan, Ahmat Sossai, Federico Saxena, Aaradhya |
author_sort | An, Sitong |
collection | CERN |
description | We report the latest development in ROOT/TMVA, a new tool that takes trained ONNX deep learning models and emits C++ code that can be easily included and invoked for fast inference of the model, with minimal dependency. An introduction to SOFIE (System for Optimized Fast Inference code Emit) is presented, with examples of interface and generated code. We discuss the latest expanded support of a variety of neural network operators, including convolutional and recurrent layers, as well as the integration with RDataFrame. We demonstrate the latest performance of this framework with a set of benchmarks. |
id | cern-2862109 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2023 |
record_format | invenio |
spelling | cern-28621092023-06-16T19:24:23Zdoi:10.1088/1742-6596/2438/1/012013http://cds.cern.ch/record/2862109engAn, SitongMoneta, LorenzoSengupta, SanjibanHamdan, AhmatSossai, FedericoSaxena, AaradhyaC++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVAWe report the latest development in ROOT/TMVA, a new tool that takes trained ONNX deep learning models and emits C++ code that can be easily included and invoked for fast inference of the model, with minimal dependency. An introduction to SOFIE (System for Optimized Fast Inference code Emit) is presented, with examples of interface and generated code. We discuss the latest expanded support of a variety of neural network operators, including convolutional and recurrent layers, as well as the integration with RDataFrame. We demonstrate the latest performance of this framework with a set of benchmarks.oai:cds.cern.ch:28621092023 |
spellingShingle | An, Sitong Moneta, Lorenzo Sengupta, Sanjiban Hamdan, Ahmat Sossai, Federico Saxena, Aaradhya C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA |
title | C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA |
title_full | C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA |
title_fullStr | C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA |
title_full_unstemmed | C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA |
title_short | C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA |
title_sort | c++ code generation for fast inference of deep learning models in root/tmva |
url | https://dx.doi.org/10.1088/1742-6596/2438/1/012013 http://cds.cern.ch/record/2862109 |
work_keys_str_mv | AT ansitong ccodegenerationforfastinferenceofdeeplearningmodelsinroottmva AT monetalorenzo ccodegenerationforfastinferenceofdeeplearningmodelsinroottmva AT senguptasanjiban ccodegenerationforfastinferenceofdeeplearningmodelsinroottmva AT hamdanahmat ccodegenerationforfastinferenceofdeeplearningmodelsinroottmva AT sossaifederico ccodegenerationforfastinferenceofdeeplearningmodelsinroottmva AT saxenaaaradhya ccodegenerationforfastinferenceofdeeplearningmodelsinroottmva |