Cargando…

How Many Bits Does it Take to Quantize Your Neural Network?

Quantization converts neural networks into low-bit fixed-point computations which can be carried out by efficient integer-only hardware, and is standard practice for the deployment of neural networks on real-time embedded devices. However, like their real-numbered counterpart, quantized networks are...

Descripción completa

Detalles Bibliográficos
Autores principales: Giacobbe, Mirco, Henzinger, Thomas A., Lechner, Mathias
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7480702/
http://dx.doi.org/10.1007/978-3-030-45237-7_5