Cargando…
C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA
We report the latest development in ROOT/TMVA, a new system that takes trained ONNX deep learning models and emits C++ code that can be easily included and invoked for fast inference of the model, with minimal dependency. We present an overview of the current solutions for conducting inference in C+...
Autores principales: | , |
---|---|
Lenguaje: | eng |
Publicado: |
2021
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1051/epjconf/202125103040 http://cds.cern.ch/record/2780369 |
_version_ | 1780971866698022912 |
---|---|
author | An, Sitong Moneta, Lorenzo |
author_facet | An, Sitong Moneta, Lorenzo |
author_sort | An, Sitong |
collection | CERN |
description | We report the latest development in ROOT/TMVA, a new system that takes trained ONNX deep learning models and emits C++ code that can be easily included and invoked for fast inference of the model, with minimal dependency. We present an overview of the current solutions for conducting inference in C++ production environment, discuss the technical details and examples of the generated code, and demonstrates its development status with a preliminary benchmark against popular tools. |
id | cern-2780369 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2021 |
record_format | invenio |
spelling | cern-27803692021-09-07T19:17:05Zdoi:10.1051/epjconf/202125103040http://cds.cern.ch/record/2780369engAn, SitongMoneta, LorenzoC++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVAComputing and ComputersWe report the latest development in ROOT/TMVA, a new system that takes trained ONNX deep learning models and emits C++ code that can be easily included and invoked for fast inference of the model, with minimal dependency. We present an overview of the current solutions for conducting inference in C++ production environment, discuss the technical details and examples of the generated code, and demonstrates its development status with a preliminary benchmark against popular tools.oai:cds.cern.ch:27803692021 |
spellingShingle | Computing and Computers An, Sitong Moneta, Lorenzo C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA |
title | C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA |
title_full | C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA |
title_fullStr | C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA |
title_full_unstemmed | C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA |
title_short | C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA |
title_sort | c++ code generation for fast inference of deep learning models in root/tmva |
topic | Computing and Computers |
url | https://dx.doi.org/10.1051/epjconf/202125103040 http://cds.cern.ch/record/2780369 |
work_keys_str_mv | AT ansitong ccodegenerationforfastinferenceofdeeplearningmodelsinroottmva AT monetalorenzo ccodegenerationforfastinferenceofdeeplearningmodelsinroottmva |