Cargando…
Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices
Tiny machine learning (TinyML) has become an emerging field according to the rapid growth in the area of the internet of things (IoT). However, most deep learning algorithms are too complex, require a lot of memory to store data, and consume an enormous amount of energy for calculation/data movement...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9919609/ https://www.ncbi.nlm.nih.gov/pubmed/36772225 http://dx.doi.org/10.3390/s23031185 |
_version_ | 1784886864942989312 |
---|---|
author | Kim, Kyungho Jang, Sung-Joon Park, Jonghee Lee, Eunchong Lee, Sang-Seol |
author_facet | Kim, Kyungho Jang, Sung-Joon Park, Jonghee Lee, Eunchong Lee, Sang-Seol |
author_sort | Kim, Kyungho |
collection | PubMed |
description | Tiny machine learning (TinyML) has become an emerging field according to the rapid growth in the area of the internet of things (IoT). However, most deep learning algorithms are too complex, require a lot of memory to store data, and consume an enormous amount of energy for calculation/data movement; therefore, the algorithms are not suitable for IoT devices such as various sensors and imaging systems. Furthermore, typical hardware accelerators cannot be embedded in these resource-constrained edge devices, and they are difficult to drive real-time inference processing as well. To perform the real-time processing on these battery-operated devices, deep learning models should be compact and hardware-optimized, and hardware accelerator designs also have to be lightweight and consume extremely low energy. Therefore, we present an optimized network model through model simplification and compression for the hardware to be implemented, and propose a hardware architecture for a lightweight and energy-efficient deep learning accelerator. The experimental results demonstrate that our optimized model successfully performs object detection, and the proposed hardware design achieves 1.25× and 4.27× smaller logic and BRAM size, respectively, and its energy consumption is approximately 10.37× lower than previous similar works with 43.95 fps as a real-time process under an operating frequency of 100 MHz on a Xilinx ZC702 FPGA. |
format | Online Article Text |
id | pubmed-9919609 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-99196092023-02-12 Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices Kim, Kyungho Jang, Sung-Joon Park, Jonghee Lee, Eunchong Lee, Sang-Seol Sensors (Basel) Article Tiny machine learning (TinyML) has become an emerging field according to the rapid growth in the area of the internet of things (IoT). However, most deep learning algorithms are too complex, require a lot of memory to store data, and consume an enormous amount of energy for calculation/data movement; therefore, the algorithms are not suitable for IoT devices such as various sensors and imaging systems. Furthermore, typical hardware accelerators cannot be embedded in these resource-constrained edge devices, and they are difficult to drive real-time inference processing as well. To perform the real-time processing on these battery-operated devices, deep learning models should be compact and hardware-optimized, and hardware accelerator designs also have to be lightweight and consume extremely low energy. Therefore, we present an optimized network model through model simplification and compression for the hardware to be implemented, and propose a hardware architecture for a lightweight and energy-efficient deep learning accelerator. The experimental results demonstrate that our optimized model successfully performs object detection, and the proposed hardware design achieves 1.25× and 4.27× smaller logic and BRAM size, respectively, and its energy consumption is approximately 10.37× lower than previous similar works with 43.95 fps as a real-time process under an operating frequency of 100 MHz on a Xilinx ZC702 FPGA. MDPI 2023-01-20 /pmc/articles/PMC9919609/ /pubmed/36772225 http://dx.doi.org/10.3390/s23031185 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Kim, Kyungho Jang, Sung-Joon Park, Jonghee Lee, Eunchong Lee, Sang-Seol Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices |
title | Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices |
title_full | Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices |
title_fullStr | Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices |
title_full_unstemmed | Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices |
title_short | Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices |
title_sort | lightweight and energy-efficient deep learning accelerator for real-time object detection on edge devices |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9919609/ https://www.ncbi.nlm.nih.gov/pubmed/36772225 http://dx.doi.org/10.3390/s23031185 |
work_keys_str_mv | AT kimkyungho lightweightandenergyefficientdeeplearningacceleratorforrealtimeobjectdetectiononedgedevices AT jangsungjoon lightweightandenergyefficientdeeplearningacceleratorforrealtimeobjectdetectiononedgedevices AT parkjonghee lightweightandenergyefficientdeeplearningacceleratorforrealtimeobjectdetectiononedgedevices AT leeeunchong lightweightandenergyefficientdeeplearningacceleratorforrealtimeobjectdetectiononedgedevices AT leesangseol lightweightandenergyefficientdeeplearningacceleratorforrealtimeobjectdetectiononedgedevices |