Cargando…

Towards an Efficient CNN Inference Architecture Enabling In-Sensor Processing †

The astounding development of optical sensing imaging technology, coupled with the impressive improvements in machine learning algorithms, has increased our ability to understand and extract information from scenic events. In most cases, Convolution neural networks (CNNs) are largely adopted to infe...

Descripción completa

Detalles Bibliográficos
Autores principales:	Pantho, Md Jubaer Hossain, Bhowmik, Pankaj, Bobda, Christophe
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8001538/ https://www.ncbi.nlm.nih.gov/pubmed/33802235 http://dx.doi.org/10.3390/s21061955

_version_	1783671253549187072
author	Pantho, Md Jubaer Hossain Bhowmik, Pankaj Bobda, Christophe
author_facet	Pantho, Md Jubaer Hossain Bhowmik, Pankaj Bobda, Christophe
author_sort	Pantho, Md Jubaer Hossain
collection	PubMed
description	The astounding development of optical sensing imaging technology, coupled with the impressive improvements in machine learning algorithms, has increased our ability to understand and extract information from scenic events. In most cases, Convolution neural networks (CNNs) are largely adopted to infer knowledge due to their surprising success in automation, surveillance, and many other application domains. However, the convolution operations’ overwhelming computation demand has somewhat limited their use in remote sensing edge devices. In these platforms, real-time processing remains a challenging task due to the tight constraints on resources and power. Here, the transfer and processing of non-relevant image pixels act as a bottleneck on the entire system. It is possible to overcome this bottleneck by exploiting the high bandwidth available at the sensor interface by designing a CNN inference architecture near the sensor. This paper presents an attention-based pixel processing architecture to facilitate the CNN inference near the image sensor. We propose an efficient computation method to reduce the dynamic power by decreasing the overall computation of the convolution operations. The proposed method reduces redundancies by using a hierarchical optimization approach. The approach minimizes power consumption for convolution operations by exploiting the Spatio-temporal redundancies found in the incoming feature maps and performs computations only on selected regions based on their relevance score. The proposed design addresses problems related to the mapping of computations onto an array of processing elements (PEs) and introduces a suitable network structure for communication. The PEs are highly optimized to provide low latency and power for CNN applications. While designing the model, we exploit the concepts of biological vision systems to reduce computation and energy. We prototype the model in a Virtex UltraScale+ FPGA and implement it in Application Specific Integrated Circuit (ASIC) using the TSMC 90nm technology library. The results suggest that the proposed architecture significantly reduces dynamic power consumption and achieves high-speed up surpassing existing embedded processors’ computational capabilities.
format	Online Article Text
id	pubmed-8001538
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-80015382021-03-28 Towards an Efficient CNN Inference Architecture Enabling In-Sensor Processing † Pantho, Md Jubaer Hossain Bhowmik, Pankaj Bobda, Christophe Sensors (Basel) Article The astounding development of optical sensing imaging technology, coupled with the impressive improvements in machine learning algorithms, has increased our ability to understand and extract information from scenic events. In most cases, Convolution neural networks (CNNs) are largely adopted to infer knowledge due to their surprising success in automation, surveillance, and many other application domains. However, the convolution operations’ overwhelming computation demand has somewhat limited their use in remote sensing edge devices. In these platforms, real-time processing remains a challenging task due to the tight constraints on resources and power. Here, the transfer and processing of non-relevant image pixels act as a bottleneck on the entire system. It is possible to overcome this bottleneck by exploiting the high bandwidth available at the sensor interface by designing a CNN inference architecture near the sensor. This paper presents an attention-based pixel processing architecture to facilitate the CNN inference near the image sensor. We propose an efficient computation method to reduce the dynamic power by decreasing the overall computation of the convolution operations. The proposed method reduces redundancies by using a hierarchical optimization approach. The approach minimizes power consumption for convolution operations by exploiting the Spatio-temporal redundancies found in the incoming feature maps and performs computations only on selected regions based on their relevance score. The proposed design addresses problems related to the mapping of computations onto an array of processing elements (PEs) and introduces a suitable network structure for communication. The PEs are highly optimized to provide low latency and power for CNN applications. While designing the model, we exploit the concepts of biological vision systems to reduce computation and energy. We prototype the model in a Virtex UltraScale+ FPGA and implement it in Application Specific Integrated Circuit (ASIC) using the TSMC 90nm technology library. The results suggest that the proposed architecture significantly reduces dynamic power consumption and achieves high-speed up surpassing existing embedded processors’ computational capabilities. MDPI 2021-03-10 /pmc/articles/PMC8001538/ /pubmed/33802235 http://dx.doi.org/10.3390/s21061955 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Pantho, Md Jubaer Hossain Bhowmik, Pankaj Bobda, Christophe Towards an Efficient CNN Inference Architecture Enabling In-Sensor Processing †
title	Towards an Efficient CNN Inference Architecture Enabling In-Sensor Processing †
title_full	Towards an Efficient CNN Inference Architecture Enabling In-Sensor Processing †
title_fullStr	Towards an Efficient CNN Inference Architecture Enabling In-Sensor Processing †
title_full_unstemmed	Towards an Efficient CNN Inference Architecture Enabling In-Sensor Processing †
title_short	Towards an Efficient CNN Inference Architecture Enabling In-Sensor Processing †
title_sort	towards an efficient cnn inference architecture enabling in-sensor processing †
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8001538/ https://www.ncbi.nlm.nih.gov/pubmed/33802235 http://dx.doi.org/10.3390/s21061955
work_keys_str_mv	AT panthomdjubaerhossain towardsanefficientcnninferencearchitectureenablinginsensorprocessing AT bhowmikpankaj towardsanefficientcnninferencearchitectureenablinginsensorprocessing AT bobdachristophe towardsanefficientcnninferencearchitectureenablinginsensorprocessing

Towards an Efficient CNN Inference Architecture Enabling In-Sensor Processing †

Ejemplares similares