Cargando…

Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices

Tiny machine learning (TinyML) has become an emerging field according to the rapid growth in the area of the internet of things (IoT). However, most deep learning algorithms are too complex, require a lot of memory to store data, and consume an enormous amount of energy for calculation/data movement...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Kyungho, Jang, Sung-Joon, Park, Jonghee, Lee, Eunchong, Lee, Sang-Seol
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9919609/
https://www.ncbi.nlm.nih.gov/pubmed/36772225
http://dx.doi.org/10.3390/s23031185
_version_ 1784886864942989312
author Kim, Kyungho
Jang, Sung-Joon
Park, Jonghee
Lee, Eunchong
Lee, Sang-Seol
author_facet Kim, Kyungho
Jang, Sung-Joon
Park, Jonghee
Lee, Eunchong
Lee, Sang-Seol
author_sort Kim, Kyungho
collection PubMed
description Tiny machine learning (TinyML) has become an emerging field according to the rapid growth in the area of the internet of things (IoT). However, most deep learning algorithms are too complex, require a lot of memory to store data, and consume an enormous amount of energy for calculation/data movement; therefore, the algorithms are not suitable for IoT devices such as various sensors and imaging systems. Furthermore, typical hardware accelerators cannot be embedded in these resource-constrained edge devices, and they are difficult to drive real-time inference processing as well. To perform the real-time processing on these battery-operated devices, deep learning models should be compact and hardware-optimized, and hardware accelerator designs also have to be lightweight and consume extremely low energy. Therefore, we present an optimized network model through model simplification and compression for the hardware to be implemented, and propose a hardware architecture for a lightweight and energy-efficient deep learning accelerator. The experimental results demonstrate that our optimized model successfully performs object detection, and the proposed hardware design achieves 1.25× and 4.27× smaller logic and BRAM size, respectively, and its energy consumption is approximately 10.37× lower than previous similar works with 43.95 fps as a real-time process under an operating frequency of 100 MHz on a Xilinx ZC702 FPGA.
format Online
Article
Text
id pubmed-9919609
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-99196092023-02-12 Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices Kim, Kyungho Jang, Sung-Joon Park, Jonghee Lee, Eunchong Lee, Sang-Seol Sensors (Basel) Article Tiny machine learning (TinyML) has become an emerging field according to the rapid growth in the area of the internet of things (IoT). However, most deep learning algorithms are too complex, require a lot of memory to store data, and consume an enormous amount of energy for calculation/data movement; therefore, the algorithms are not suitable for IoT devices such as various sensors and imaging systems. Furthermore, typical hardware accelerators cannot be embedded in these resource-constrained edge devices, and they are difficult to drive real-time inference processing as well. To perform the real-time processing on these battery-operated devices, deep learning models should be compact and hardware-optimized, and hardware accelerator designs also have to be lightweight and consume extremely low energy. Therefore, we present an optimized network model through model simplification and compression for the hardware to be implemented, and propose a hardware architecture for a lightweight and energy-efficient deep learning accelerator. The experimental results demonstrate that our optimized model successfully performs object detection, and the proposed hardware design achieves 1.25× and 4.27× smaller logic and BRAM size, respectively, and its energy consumption is approximately 10.37× lower than previous similar works with 43.95 fps as a real-time process under an operating frequency of 100 MHz on a Xilinx ZC702 FPGA. MDPI 2023-01-20 /pmc/articles/PMC9919609/ /pubmed/36772225 http://dx.doi.org/10.3390/s23031185 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kim, Kyungho
Jang, Sung-Joon
Park, Jonghee
Lee, Eunchong
Lee, Sang-Seol
Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices
title Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices
title_full Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices
title_fullStr Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices
title_full_unstemmed Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices
title_short Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices
title_sort lightweight and energy-efficient deep learning accelerator for real-time object detection on edge devices
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9919609/
https://www.ncbi.nlm.nih.gov/pubmed/36772225
http://dx.doi.org/10.3390/s23031185
work_keys_str_mv AT kimkyungho lightweightandenergyefficientdeeplearningacceleratorforrealtimeobjectdetectiononedgedevices
AT jangsungjoon lightweightandenergyefficientdeeplearningacceleratorforrealtimeobjectdetectiononedgedevices
AT parkjonghee lightweightandenergyefficientdeeplearningacceleratorforrealtimeobjectdetectiononedgedevices
AT leeeunchong lightweightandenergyefficientdeeplearningacceleratorforrealtimeobjectdetectiononedgedevices
AT leesangseol lightweightandenergyefficientdeeplearningacceleratorforrealtimeobjectdetectiononedgedevices