Cargando…

SRAM-Based CIM Architecture Design for Event Detection †

Convolutional neural networks (CNNs) play a key role in deep learning applications. However, the high computational complexity and high-energy consumption of CNNs trammel their application in hardware accelerators. Computing-in-memory (CIM) is the technique of running calculations entirely in memory...

Descripción completa

Detalles Bibliográficos
Autores principales: Sulaiman, Muhammad Bintang Gemintang, Lin, Jin-Yu, Li, Jian-Bai, Shih, Cheng-Ming, Juang, Kai-Cheung, Lu, Chih-Cheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9610894/
https://www.ncbi.nlm.nih.gov/pubmed/36298205
http://dx.doi.org/10.3390/s22207854
_version_ 1784819391327633408
author Sulaiman, Muhammad Bintang Gemintang
Lin, Jin-Yu
Li, Jian-Bai
Shih, Cheng-Ming
Juang, Kai-Cheung
Lu, Chih-Cheng
author_facet Sulaiman, Muhammad Bintang Gemintang
Lin, Jin-Yu
Li, Jian-Bai
Shih, Cheng-Ming
Juang, Kai-Cheung
Lu, Chih-Cheng
author_sort Sulaiman, Muhammad Bintang Gemintang
collection PubMed
description Convolutional neural networks (CNNs) play a key role in deep learning applications. However, the high computational complexity and high-energy consumption of CNNs trammel their application in hardware accelerators. Computing-in-memory (CIM) is the technique of running calculations entirely in memory (in our design, we use SRAM). CIM architecture has demonstrated great potential to effectively compute large-scale matrix-vector multiplication. CIM-based architecture for event detection is designed to trigger the next stage of precision inference. To implement an SRAM-based CIM accelerator, a software and hardware co-design approach must consider the CIM macro’s hardware limitations to map the weight onto the AI edge devices. In this paper, we designed a hierarchical AI architecture to optimize the end-to-end system power in the AIoT application. In the experiment, the CIM-aware algorithm with 4-bit activation and 8-bit weight is examined on hand gesture and CIFAR-10 datasets, and determined to have 99.70% and 70.58% accuracy, respectively. A profiling tool to analyze the proposed design is also developed to measure how efficient our architecture design is. The proposed design system utilizes the operating frequency of 100 MHz, hand gesture and CIFAR-10 as the datasets, and nine CNNs and one FC layer as its network, resulting in a frame rate of 662 FPS, 37.6% processing unit utilization, and a power consumption of 0.853 mW.
format Online
Article
Text
id pubmed-9610894
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96108942022-10-28 SRAM-Based CIM Architecture Design for Event Detection † Sulaiman, Muhammad Bintang Gemintang Lin, Jin-Yu Li, Jian-Bai Shih, Cheng-Ming Juang, Kai-Cheung Lu, Chih-Cheng Sensors (Basel) Article Convolutional neural networks (CNNs) play a key role in deep learning applications. However, the high computational complexity and high-energy consumption of CNNs trammel their application in hardware accelerators. Computing-in-memory (CIM) is the technique of running calculations entirely in memory (in our design, we use SRAM). CIM architecture has demonstrated great potential to effectively compute large-scale matrix-vector multiplication. CIM-based architecture for event detection is designed to trigger the next stage of precision inference. To implement an SRAM-based CIM accelerator, a software and hardware co-design approach must consider the CIM macro’s hardware limitations to map the weight onto the AI edge devices. In this paper, we designed a hierarchical AI architecture to optimize the end-to-end system power in the AIoT application. In the experiment, the CIM-aware algorithm with 4-bit activation and 8-bit weight is examined on hand gesture and CIFAR-10 datasets, and determined to have 99.70% and 70.58% accuracy, respectively. A profiling tool to analyze the proposed design is also developed to measure how efficient our architecture design is. The proposed design system utilizes the operating frequency of 100 MHz, hand gesture and CIFAR-10 as the datasets, and nine CNNs and one FC layer as its network, resulting in a frame rate of 662 FPS, 37.6% processing unit utilization, and a power consumption of 0.853 mW. MDPI 2022-10-16 /pmc/articles/PMC9610894/ /pubmed/36298205 http://dx.doi.org/10.3390/s22207854 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Sulaiman, Muhammad Bintang Gemintang
Lin, Jin-Yu
Li, Jian-Bai
Shih, Cheng-Ming
Juang, Kai-Cheung
Lu, Chih-Cheng
SRAM-Based CIM Architecture Design for Event Detection †
title SRAM-Based CIM Architecture Design for Event Detection †
title_full SRAM-Based CIM Architecture Design for Event Detection †
title_fullStr SRAM-Based CIM Architecture Design for Event Detection †
title_full_unstemmed SRAM-Based CIM Architecture Design for Event Detection †
title_short SRAM-Based CIM Architecture Design for Event Detection †
title_sort sram-based cim architecture design for event detection †
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9610894/
https://www.ncbi.nlm.nih.gov/pubmed/36298205
http://dx.doi.org/10.3390/s22207854
work_keys_str_mv AT sulaimanmuhammadbintanggemintang srambasedcimarchitecturedesignforeventdetection
AT linjinyu srambasedcimarchitecturedesignforeventdetection
AT lijianbai srambasedcimarchitecturedesignforeventdetection
AT shihchengming srambasedcimarchitecturedesignforeventdetection
AT juangkaicheung srambasedcimarchitecturedesignforeventdetection
AT luchihcheng srambasedcimarchitecturedesignforeventdetection