Cargando…

GradFreeBits: Gradient-Free Bit Allocation for Mixed-Precision Neural Networks

Quantized neural networks (QNNs) are among the main approaches for deploying deep neural networks on low-resource edge devices. Training QNNs using different levels of precision throughout the network (mixed-precision quantization) typically achieves superior trade-offs between performance and compu...

Descripción completa

Detalles Bibliográficos
Autores principales: Bodner, Benjamin Jacob, Ben-Shalom, Gil, Treister, Eran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9787339/
https://www.ncbi.nlm.nih.gov/pubmed/36560141
http://dx.doi.org/10.3390/s22249772