Cargando…

On-sensor binarized CNN inference with dynamic model swapping in pixel processor arrays

Many types of Convolutional Neural Network (CNN) models and training methods have been proposed in recent years aiming to provide efficiency for embedded and edge devices with limited computation and memory resources. The wide variety of architectures makes this a complex task that has to balance ge...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Yanan, Bose, Laurie, Fan, Rui, Dudek, Piotr, Mayol-Cuevas, Walterio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9421154/
https://www.ncbi.nlm.nih.gov/pubmed/36046469
http://dx.doi.org/10.3389/fnins.2022.909448
_version_ 1784777534627381248
author Liu, Yanan
Bose, Laurie
Fan, Rui
Dudek, Piotr
Mayol-Cuevas, Walterio
author_facet Liu, Yanan
Bose, Laurie
Fan, Rui
Dudek, Piotr
Mayol-Cuevas, Walterio
author_sort Liu, Yanan
collection PubMed
description Many types of Convolutional Neural Network (CNN) models and training methods have been proposed in recent years aiming to provide efficiency for embedded and edge devices with limited computation and memory resources. The wide variety of architectures makes this a complex task that has to balance generality with efficiency. Among the most interesting camera-sensor architectures are Pixel Processor Arrays (PPAs). This study presents two methods that are useful for embedded CNNs in general but particularly suitable for PPAs. The first is for training purely binarized CNNs, the second is for deploying larger models with a model swapping paradigm that loads model components dynamically. Specifically, this study trains and implements networks with batch normalization and adaptive threshold for binary activations. Then, we convert batch normalization and binary activations into a bias matrix which can be parallelly implemented by an add/sub operation. For dynamic model swapping, we propose to decompose applications that are beyond the capacity of a PPA into sub-tasks that can be solved by tree networks that can be loaded dynamically as needed. We demonstrate our approaches to various tasks including classification, localization, and coarse segmentation on a highly resource constrained PPA sensor-processor.
format Online
Article
Text
id pubmed-9421154
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-94211542022-08-30 On-sensor binarized CNN inference with dynamic model swapping in pixel processor arrays Liu, Yanan Bose, Laurie Fan, Rui Dudek, Piotr Mayol-Cuevas, Walterio Front Neurosci Neuroscience Many types of Convolutional Neural Network (CNN) models and training methods have been proposed in recent years aiming to provide efficiency for embedded and edge devices with limited computation and memory resources. The wide variety of architectures makes this a complex task that has to balance generality with efficiency. Among the most interesting camera-sensor architectures are Pixel Processor Arrays (PPAs). This study presents two methods that are useful for embedded CNNs in general but particularly suitable for PPAs. The first is for training purely binarized CNNs, the second is for deploying larger models with a model swapping paradigm that loads model components dynamically. Specifically, this study trains and implements networks with batch normalization and adaptive threshold for binary activations. Then, we convert batch normalization and binary activations into a bias matrix which can be parallelly implemented by an add/sub operation. For dynamic model swapping, we propose to decompose applications that are beyond the capacity of a PPA into sub-tasks that can be solved by tree networks that can be loaded dynamically as needed. We demonstrate our approaches to various tasks including classification, localization, and coarse segmentation on a highly resource constrained PPA sensor-processor. Frontiers Media S.A. 2022-08-15 /pmc/articles/PMC9421154/ /pubmed/36046469 http://dx.doi.org/10.3389/fnins.2022.909448 Text en Copyright © 2022 Liu, Bose, Fan, Dudek and Mayol-Cuevas. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Liu, Yanan
Bose, Laurie
Fan, Rui
Dudek, Piotr
Mayol-Cuevas, Walterio
On-sensor binarized CNN inference with dynamic model swapping in pixel processor arrays
title On-sensor binarized CNN inference with dynamic model swapping in pixel processor arrays
title_full On-sensor binarized CNN inference with dynamic model swapping in pixel processor arrays
title_fullStr On-sensor binarized CNN inference with dynamic model swapping in pixel processor arrays
title_full_unstemmed On-sensor binarized CNN inference with dynamic model swapping in pixel processor arrays
title_short On-sensor binarized CNN inference with dynamic model swapping in pixel processor arrays
title_sort on-sensor binarized cnn inference with dynamic model swapping in pixel processor arrays
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9421154/
https://www.ncbi.nlm.nih.gov/pubmed/36046469
http://dx.doi.org/10.3389/fnins.2022.909448
work_keys_str_mv AT liuyanan onsensorbinarizedcnninferencewithdynamicmodelswappinginpixelprocessorarrays
AT boselaurie onsensorbinarizedcnninferencewithdynamicmodelswappinginpixelprocessorarrays
AT fanrui onsensorbinarizedcnninferencewithdynamicmodelswappinginpixelprocessorarrays
AT dudekpiotr onsensorbinarizedcnninferencewithdynamicmodelswappinginpixelprocessorarrays
AT mayolcuevaswalterio onsensorbinarizedcnninferencewithdynamicmodelswappinginpixelprocessorarrays