Cargando…
SRAM-Based CIM Architecture Design for Event Detection †
Convolutional neural networks (CNNs) play a key role in deep learning applications. However, the high computational complexity and high-energy consumption of CNNs trammel their application in hardware accelerators. Computing-in-memory (CIM) is the technique of running calculations entirely in memory...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9610894/ https://www.ncbi.nlm.nih.gov/pubmed/36298205 http://dx.doi.org/10.3390/s22207854 |
_version_ | 1784819391327633408 |
---|---|
author | Sulaiman, Muhammad Bintang Gemintang Lin, Jin-Yu Li, Jian-Bai Shih, Cheng-Ming Juang, Kai-Cheung Lu, Chih-Cheng |
author_facet | Sulaiman, Muhammad Bintang Gemintang Lin, Jin-Yu Li, Jian-Bai Shih, Cheng-Ming Juang, Kai-Cheung Lu, Chih-Cheng |
author_sort | Sulaiman, Muhammad Bintang Gemintang |
collection | PubMed |
description | Convolutional neural networks (CNNs) play a key role in deep learning applications. However, the high computational complexity and high-energy consumption of CNNs trammel their application in hardware accelerators. Computing-in-memory (CIM) is the technique of running calculations entirely in memory (in our design, we use SRAM). CIM architecture has demonstrated great potential to effectively compute large-scale matrix-vector multiplication. CIM-based architecture for event detection is designed to trigger the next stage of precision inference. To implement an SRAM-based CIM accelerator, a software and hardware co-design approach must consider the CIM macro’s hardware limitations to map the weight onto the AI edge devices. In this paper, we designed a hierarchical AI architecture to optimize the end-to-end system power in the AIoT application. In the experiment, the CIM-aware algorithm with 4-bit activation and 8-bit weight is examined on hand gesture and CIFAR-10 datasets, and determined to have 99.70% and 70.58% accuracy, respectively. A profiling tool to analyze the proposed design is also developed to measure how efficient our architecture design is. The proposed design system utilizes the operating frequency of 100 MHz, hand gesture and CIFAR-10 as the datasets, and nine CNNs and one FC layer as its network, resulting in a frame rate of 662 FPS, 37.6% processing unit utilization, and a power consumption of 0.853 mW. |
format | Online Article Text |
id | pubmed-9610894 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-96108942022-10-28 SRAM-Based CIM Architecture Design for Event Detection † Sulaiman, Muhammad Bintang Gemintang Lin, Jin-Yu Li, Jian-Bai Shih, Cheng-Ming Juang, Kai-Cheung Lu, Chih-Cheng Sensors (Basel) Article Convolutional neural networks (CNNs) play a key role in deep learning applications. However, the high computational complexity and high-energy consumption of CNNs trammel their application in hardware accelerators. Computing-in-memory (CIM) is the technique of running calculations entirely in memory (in our design, we use SRAM). CIM architecture has demonstrated great potential to effectively compute large-scale matrix-vector multiplication. CIM-based architecture for event detection is designed to trigger the next stage of precision inference. To implement an SRAM-based CIM accelerator, a software and hardware co-design approach must consider the CIM macro’s hardware limitations to map the weight onto the AI edge devices. In this paper, we designed a hierarchical AI architecture to optimize the end-to-end system power in the AIoT application. In the experiment, the CIM-aware algorithm with 4-bit activation and 8-bit weight is examined on hand gesture and CIFAR-10 datasets, and determined to have 99.70% and 70.58% accuracy, respectively. A profiling tool to analyze the proposed design is also developed to measure how efficient our architecture design is. The proposed design system utilizes the operating frequency of 100 MHz, hand gesture and CIFAR-10 as the datasets, and nine CNNs and one FC layer as its network, resulting in a frame rate of 662 FPS, 37.6% processing unit utilization, and a power consumption of 0.853 mW. MDPI 2022-10-16 /pmc/articles/PMC9610894/ /pubmed/36298205 http://dx.doi.org/10.3390/s22207854 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Sulaiman, Muhammad Bintang Gemintang Lin, Jin-Yu Li, Jian-Bai Shih, Cheng-Ming Juang, Kai-Cheung Lu, Chih-Cheng SRAM-Based CIM Architecture Design for Event Detection † |
title | SRAM-Based CIM Architecture Design for Event Detection † |
title_full | SRAM-Based CIM Architecture Design for Event Detection † |
title_fullStr | SRAM-Based CIM Architecture Design for Event Detection † |
title_full_unstemmed | SRAM-Based CIM Architecture Design for Event Detection † |
title_short | SRAM-Based CIM Architecture Design for Event Detection † |
title_sort | sram-based cim architecture design for event detection † |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9610894/ https://www.ncbi.nlm.nih.gov/pubmed/36298205 http://dx.doi.org/10.3390/s22207854 |
work_keys_str_mv | AT sulaimanmuhammadbintanggemintang srambasedcimarchitecturedesignforeventdetection AT linjinyu srambasedcimarchitecturedesignforeventdetection AT lijianbai srambasedcimarchitecturedesignforeventdetection AT shihchengming srambasedcimarchitecturedesignforeventdetection AT juangkaicheung srambasedcimarchitecturedesignforeventdetection AT luchihcheng srambasedcimarchitecturedesignforeventdetection |