Cargando…
C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA
<!--HTML-->We report the latest development in ROOT/TMVA, a new system that takes trained ONNX deep learning models and emits C++ code that can be easily included and invoked for fast inference of the model, with minimal dependency. We present an overview of the current solutions for conductin...
Autor principal: | |
---|---|
Lenguaje: | eng |
Publicado: |
2021
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/2766903 |
_version_ | 1780971246701248512 |
---|---|
author | An, Sitong |
author_facet | An, Sitong |
author_sort | An, Sitong |
collection | CERN |
description | <!--HTML-->We report the latest development in ROOT/TMVA, a new system that takes trained ONNX deep learning models and emits C++ code that can be easily included and invoked for fast inference of the model, with minimal dependency. We present an overview of the current solutions for conducting inference in C++ production environment, discuss the technical details and examples of the generated code, and demonstrates its development status with a preliminary benchmark against popular tools. |
id | cern-2766903 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2021 |
record_format | invenio |
spelling | cern-27669032022-11-02T22:25:53Zhttp://cds.cern.ch/record/2766903engAn, SitongC++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA25th International Conference on Computing in High Energy & Nuclear PhysicsConferences<!--HTML-->We report the latest development in ROOT/TMVA, a new system that takes trained ONNX deep learning models and emits C++ code that can be easily included and invoked for fast inference of the model, with minimal dependency. We present an overview of the current solutions for conducting inference in C++ production environment, discuss the technical details and examples of the generated code, and demonstrates its development status with a preliminary benchmark against popular tools.oai:cds.cern.ch:27669032021 |
spellingShingle | Conferences An, Sitong C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA |
title | C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA |
title_full | C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA |
title_fullStr | C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA |
title_full_unstemmed | C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA |
title_short | C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA |
title_sort | c++ code generation for fast inference of deep learning models in root/tmva |
topic | Conferences |
url | http://cds.cern.ch/record/2766903 |
work_keys_str_mv | AT ansitong ccodegenerationforfastinferenceofdeeplearningmodelsinroottmva AT ansitong 25thinternationalconferenceoncomputinginhighenergynuclearphysics |