Cargando…
On-sensor binarized CNN inference with dynamic model swapping in pixel processor arrays
Many types of Convolutional Neural Network (CNN) models and training methods have been proposed in recent years aiming to provide efficiency for embedded and edge devices with limited computation and memory resources. The wide variety of architectures makes this a complex task that has to balance ge...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9421154/ https://www.ncbi.nlm.nih.gov/pubmed/36046469 http://dx.doi.org/10.3389/fnins.2022.909448 |
_version_ | 1784777534627381248 |
---|---|
author | Liu, Yanan Bose, Laurie Fan, Rui Dudek, Piotr Mayol-Cuevas, Walterio |
author_facet | Liu, Yanan Bose, Laurie Fan, Rui Dudek, Piotr Mayol-Cuevas, Walterio |
author_sort | Liu, Yanan |
collection | PubMed |
description | Many types of Convolutional Neural Network (CNN) models and training methods have been proposed in recent years aiming to provide efficiency for embedded and edge devices with limited computation and memory resources. The wide variety of architectures makes this a complex task that has to balance generality with efficiency. Among the most interesting camera-sensor architectures are Pixel Processor Arrays (PPAs). This study presents two methods that are useful for embedded CNNs in general but particularly suitable for PPAs. The first is for training purely binarized CNNs, the second is for deploying larger models with a model swapping paradigm that loads model components dynamically. Specifically, this study trains and implements networks with batch normalization and adaptive threshold for binary activations. Then, we convert batch normalization and binary activations into a bias matrix which can be parallelly implemented by an add/sub operation. For dynamic model swapping, we propose to decompose applications that are beyond the capacity of a PPA into sub-tasks that can be solved by tree networks that can be loaded dynamically as needed. We demonstrate our approaches to various tasks including classification, localization, and coarse segmentation on a highly resource constrained PPA sensor-processor. |
format | Online Article Text |
id | pubmed-9421154 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-94211542022-08-30 On-sensor binarized CNN inference with dynamic model swapping in pixel processor arrays Liu, Yanan Bose, Laurie Fan, Rui Dudek, Piotr Mayol-Cuevas, Walterio Front Neurosci Neuroscience Many types of Convolutional Neural Network (CNN) models and training methods have been proposed in recent years aiming to provide efficiency for embedded and edge devices with limited computation and memory resources. The wide variety of architectures makes this a complex task that has to balance generality with efficiency. Among the most interesting camera-sensor architectures are Pixel Processor Arrays (PPAs). This study presents two methods that are useful for embedded CNNs in general but particularly suitable for PPAs. The first is for training purely binarized CNNs, the second is for deploying larger models with a model swapping paradigm that loads model components dynamically. Specifically, this study trains and implements networks with batch normalization and adaptive threshold for binary activations. Then, we convert batch normalization and binary activations into a bias matrix which can be parallelly implemented by an add/sub operation. For dynamic model swapping, we propose to decompose applications that are beyond the capacity of a PPA into sub-tasks that can be solved by tree networks that can be loaded dynamically as needed. We demonstrate our approaches to various tasks including classification, localization, and coarse segmentation on a highly resource constrained PPA sensor-processor. Frontiers Media S.A. 2022-08-15 /pmc/articles/PMC9421154/ /pubmed/36046469 http://dx.doi.org/10.3389/fnins.2022.909448 Text en Copyright © 2022 Liu, Bose, Fan, Dudek and Mayol-Cuevas. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Neuroscience Liu, Yanan Bose, Laurie Fan, Rui Dudek, Piotr Mayol-Cuevas, Walterio On-sensor binarized CNN inference with dynamic model swapping in pixel processor arrays |
title | On-sensor binarized CNN inference with dynamic model swapping in pixel processor arrays |
title_full | On-sensor binarized CNN inference with dynamic model swapping in pixel processor arrays |
title_fullStr | On-sensor binarized CNN inference with dynamic model swapping in pixel processor arrays |
title_full_unstemmed | On-sensor binarized CNN inference with dynamic model swapping in pixel processor arrays |
title_short | On-sensor binarized CNN inference with dynamic model swapping in pixel processor arrays |
title_sort | on-sensor binarized cnn inference with dynamic model swapping in pixel processor arrays |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9421154/ https://www.ncbi.nlm.nih.gov/pubmed/36046469 http://dx.doi.org/10.3389/fnins.2022.909448 |
work_keys_str_mv | AT liuyanan onsensorbinarizedcnninferencewithdynamicmodelswappinginpixelprocessorarrays AT boselaurie onsensorbinarizedcnninferencewithdynamicmodelswappinginpixelprocessorarrays AT fanrui onsensorbinarizedcnninferencewithdynamicmodelswappinginpixelprocessorarrays AT dudekpiotr onsensorbinarizedcnninferencewithdynamicmodelswappinginpixelprocessorarrays AT mayolcuevaswalterio onsensorbinarizedcnninferencewithdynamicmodelswappinginpixelprocessorarrays |