Cargando…

C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA

We report the latest development in ROOT/TMVA, a new tool that takes trained ONNX deep learning models and emits C++ code that can be easily included and invoked for fast inference of the model, with minimal dependency. An introduction to SOFIE (System for Optimized Fast Inference code Emit) is pres...

Descripción completa

Detalles Bibliográficos
Autores principales:	An, Sitong, Moneta, Lorenzo, Sengupta, Sanjiban, Hamdan, Ahmat, Sossai, Federico, Saxena, Aaradhya
Lenguaje:	eng
Publicado:	2023
Acceso en línea:	https://dx.doi.org/10.1088/1742-6596/2438/1/012013 http://cds.cern.ch/record/2862109

_version_	1780977856954761216
author	An, Sitong Moneta, Lorenzo Sengupta, Sanjiban Hamdan, Ahmat Sossai, Federico Saxena, Aaradhya
author_facet	An, Sitong Moneta, Lorenzo Sengupta, Sanjiban Hamdan, Ahmat Sossai, Federico Saxena, Aaradhya
author_sort	An, Sitong
collection	CERN
description	We report the latest development in ROOT/TMVA, a new tool that takes trained ONNX deep learning models and emits C++ code that can be easily included and invoked for fast inference of the model, with minimal dependency. An introduction to SOFIE (System for Optimized Fast Inference code Emit) is presented, with examples of interface and generated code. We discuss the latest expanded support of a variety of neural network operators, including convolutional and recurrent layers, as well as the integration with RDataFrame. We demonstrate the latest performance of this framework with a set of benchmarks.
id	cern-2862109
institution	Organización Europea para la Investigación Nuclear
language	eng
publishDate	2023
record_format	invenio
spelling	cern-28621092023-06-16T19:24:23Zdoi:10.1088/1742-6596/2438/1/012013http://cds.cern.ch/record/2862109engAn, SitongMoneta, LorenzoSengupta, SanjibanHamdan, AhmatSossai, FedericoSaxena, AaradhyaC++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVAWe report the latest development in ROOT/TMVA, a new tool that takes trained ONNX deep learning models and emits C++ code that can be easily included and invoked for fast inference of the model, with minimal dependency. An introduction to SOFIE (System for Optimized Fast Inference code Emit) is presented, with examples of interface and generated code. We discuss the latest expanded support of a variety of neural network operators, including convolutional and recurrent layers, as well as the integration with RDataFrame. We demonstrate the latest performance of this framework with a set of benchmarks.oai:cds.cern.ch:28621092023
spellingShingle	An, Sitong Moneta, Lorenzo Sengupta, Sanjiban Hamdan, Ahmat Sossai, Federico Saxena, Aaradhya C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA
title	C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA
title_full	C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA
title_fullStr	C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA
title_full_unstemmed	C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA
title_short	C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA
title_sort	c++ code generation for fast inference of deep learning models in root/tmva
url	https://dx.doi.org/10.1088/1742-6596/2438/1/012013 http://cds.cern.ch/record/2862109
work_keys_str_mv	AT ansitong ccodegenerationforfastinferenceofdeeplearningmodelsinroottmva AT monetalorenzo ccodegenerationforfastinferenceofdeeplearningmodelsinroottmva AT senguptasanjiban ccodegenerationforfastinferenceofdeeplearningmodelsinroottmva AT hamdanahmat ccodegenerationforfastinferenceofdeeplearningmodelsinroottmva AT sossaifederico ccodegenerationforfastinferenceofdeeplearningmodelsinroottmva AT saxenaaaradhya ccodegenerationforfastinferenceofdeeplearningmodelsinroottmva

C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA

Ejemplares similares