Cargando…

SRAM-Based CIM Architecture Design for Event Detection †

Convolutional neural networks (CNNs) play a key role in deep learning applications. However, the high computational complexity and high-energy consumption of CNNs trammel their application in hardware accelerators. Computing-in-memory (CIM) is the technique of running calculations entirely in memory...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sulaiman, Muhammad Bintang Gemintang, Lin, Jin-Yu, Li, Jian-Bai, Shih, Cheng-Ming, Juang, Kai-Cheung, Lu, Chih-Cheng
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9610894/ https://www.ncbi.nlm.nih.gov/pubmed/36298205 http://dx.doi.org/10.3390/s22207854

_version_	1784819391327633408
author	Sulaiman, Muhammad Bintang Gemintang Lin, Jin-Yu Li, Jian-Bai Shih, Cheng-Ming Juang, Kai-Cheung Lu, Chih-Cheng
author_facet	Sulaiman, Muhammad Bintang Gemintang Lin, Jin-Yu Li, Jian-Bai Shih, Cheng-Ming Juang, Kai-Cheung Lu, Chih-Cheng
author_sort	Sulaiman, Muhammad Bintang Gemintang
collection	PubMed
description	Convolutional neural networks (CNNs) play a key role in deep learning applications. However, the high computational complexity and high-energy consumption of CNNs trammel their application in hardware accelerators. Computing-in-memory (CIM) is the technique of running calculations entirely in memory (in our design, we use SRAM). CIM architecture has demonstrated great potential to effectively compute large-scale matrix-vector multiplication. CIM-based architecture for event detection is designed to trigger the next stage of precision inference. To implement an SRAM-based CIM accelerator, a software and hardware co-design approach must consider the CIM macro’s hardware limitations to map the weight onto the AI edge devices. In this paper, we designed a hierarchical AI architecture to optimize the end-to-end system power in the AIoT application. In the experiment, the CIM-aware algorithm with 4-bit activation and 8-bit weight is examined on hand gesture and CIFAR-10 datasets, and determined to have 99.70% and 70.58% accuracy, respectively. A profiling tool to analyze the proposed design is also developed to measure how efficient our architecture design is. The proposed design system utilizes the operating frequency of 100 MHz, hand gesture and CIFAR-10 as the datasets, and nine CNNs and one FC layer as its network, resulting in a frame rate of 662 FPS, 37.6% processing unit utilization, and a power consumption of 0.853 mW.
format	Online Article Text
id	pubmed-9610894
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-96108942022-10-28 SRAM-Based CIM Architecture Design for Event Detection † Sulaiman, Muhammad Bintang Gemintang Lin, Jin-Yu Li, Jian-Bai Shih, Cheng-Ming Juang, Kai-Cheung Lu, Chih-Cheng Sensors (Basel) Article Convolutional neural networks (CNNs) play a key role in deep learning applications. However, the high computational complexity and high-energy consumption of CNNs trammel their application in hardware accelerators. Computing-in-memory (CIM) is the technique of running calculations entirely in memory (in our design, we use SRAM). CIM architecture has demonstrated great potential to effectively compute large-scale matrix-vector multiplication. CIM-based architecture for event detection is designed to trigger the next stage of precision inference. To implement an SRAM-based CIM accelerator, a software and hardware co-design approach must consider the CIM macro’s hardware limitations to map the weight onto the AI edge devices. In this paper, we designed a hierarchical AI architecture to optimize the end-to-end system power in the AIoT application. In the experiment, the CIM-aware algorithm with 4-bit activation and 8-bit weight is examined on hand gesture and CIFAR-10 datasets, and determined to have 99.70% and 70.58% accuracy, respectively. A profiling tool to analyze the proposed design is also developed to measure how efficient our architecture design is. The proposed design system utilizes the operating frequency of 100 MHz, hand gesture and CIFAR-10 as the datasets, and nine CNNs and one FC layer as its network, resulting in a frame rate of 662 FPS, 37.6% processing unit utilization, and a power consumption of 0.853 mW. MDPI 2022-10-16 /pmc/articles/PMC9610894/ /pubmed/36298205 http://dx.doi.org/10.3390/s22207854 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Sulaiman, Muhammad Bintang Gemintang Lin, Jin-Yu Li, Jian-Bai Shih, Cheng-Ming Juang, Kai-Cheung Lu, Chih-Cheng SRAM-Based CIM Architecture Design for Event Detection †
title	SRAM-Based CIM Architecture Design for Event Detection †
title_full	SRAM-Based CIM Architecture Design for Event Detection †
title_fullStr	SRAM-Based CIM Architecture Design for Event Detection †
title_full_unstemmed	SRAM-Based CIM Architecture Design for Event Detection †
title_short	SRAM-Based CIM Architecture Design for Event Detection †
title_sort	sram-based cim architecture design for event detection †
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9610894/ https://www.ncbi.nlm.nih.gov/pubmed/36298205 http://dx.doi.org/10.3390/s22207854
work_keys_str_mv	AT sulaimanmuhammadbintanggemintang srambasedcimarchitecturedesignforeventdetection AT linjinyu srambasedcimarchitecturedesignforeventdetection AT lijianbai srambasedcimarchitecturedesignforeventdetection AT shihchengming srambasedcimarchitecturedesignforeventdetection AT juangkaicheung srambasedcimarchitecturedesignforeventdetection AT luchihcheng srambasedcimarchitecturedesignforeventdetection

SRAM-Based CIM Architecture Design for Event Detection †

Ejemplares similares