Cargando…

Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices

Tiny machine learning (TinyML) has become an emerging field according to the rapid growth in the area of the internet of things (IoT). However, most deep learning algorithms are too complex, require a lot of memory to store data, and consume an enormous amount of energy for calculation/data movement...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kim, Kyungho, Jang, Sung-Joon, Park, Jonghee, Lee, Eunchong, Lee, Sang-Seol
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9919609/ https://www.ncbi.nlm.nih.gov/pubmed/36772225 http://dx.doi.org/10.3390/s23031185

_version_	1784886864942989312
author	Kim, Kyungho Jang, Sung-Joon Park, Jonghee Lee, Eunchong Lee, Sang-Seol
author_facet	Kim, Kyungho Jang, Sung-Joon Park, Jonghee Lee, Eunchong Lee, Sang-Seol
author_sort	Kim, Kyungho
collection	PubMed
description	Tiny machine learning (TinyML) has become an emerging field according to the rapid growth in the area of the internet of things (IoT). However, most deep learning algorithms are too complex, require a lot of memory to store data, and consume an enormous amount of energy for calculation/data movement; therefore, the algorithms are not suitable for IoT devices such as various sensors and imaging systems. Furthermore, typical hardware accelerators cannot be embedded in these resource-constrained edge devices, and they are difficult to drive real-time inference processing as well. To perform the real-time processing on these battery-operated devices, deep learning models should be compact and hardware-optimized, and hardware accelerator designs also have to be lightweight and consume extremely low energy. Therefore, we present an optimized network model through model simplification and compression for the hardware to be implemented, and propose a hardware architecture for a lightweight and energy-efficient deep learning accelerator. The experimental results demonstrate that our optimized model successfully performs object detection, and the proposed hardware design achieves 1.25× and 4.27× smaller logic and BRAM size, respectively, and its energy consumption is approximately 10.37× lower than previous similar works with 43.95 fps as a real-time process under an operating frequency of 100 MHz on a Xilinx ZC702 FPGA.
format	Online Article Text
id	pubmed-9919609
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-99196092023-02-12 Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices Kim, Kyungho Jang, Sung-Joon Park, Jonghee Lee, Eunchong Lee, Sang-Seol Sensors (Basel) Article Tiny machine learning (TinyML) has become an emerging field according to the rapid growth in the area of the internet of things (IoT). However, most deep learning algorithms are too complex, require a lot of memory to store data, and consume an enormous amount of energy for calculation/data movement; therefore, the algorithms are not suitable for IoT devices such as various sensors and imaging systems. Furthermore, typical hardware accelerators cannot be embedded in these resource-constrained edge devices, and they are difficult to drive real-time inference processing as well. To perform the real-time processing on these battery-operated devices, deep learning models should be compact and hardware-optimized, and hardware accelerator designs also have to be lightweight and consume extremely low energy. Therefore, we present an optimized network model through model simplification and compression for the hardware to be implemented, and propose a hardware architecture for a lightweight and energy-efficient deep learning accelerator. The experimental results demonstrate that our optimized model successfully performs object detection, and the proposed hardware design achieves 1.25× and 4.27× smaller logic and BRAM size, respectively, and its energy consumption is approximately 10.37× lower than previous similar works with 43.95 fps as a real-time process under an operating frequency of 100 MHz on a Xilinx ZC702 FPGA. MDPI 2023-01-20 /pmc/articles/PMC9919609/ /pubmed/36772225 http://dx.doi.org/10.3390/s23031185 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Kim, Kyungho Jang, Sung-Joon Park, Jonghee Lee, Eunchong Lee, Sang-Seol Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices
title	Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices
title_full	Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices
title_fullStr	Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices
title_full_unstemmed	Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices
title_short	Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices
title_sort	lightweight and energy-efficient deep learning accelerator for real-time object detection on edge devices
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9919609/ https://www.ncbi.nlm.nih.gov/pubmed/36772225 http://dx.doi.org/10.3390/s23031185
work_keys_str_mv	AT kimkyungho lightweightandenergyefficientdeeplearningacceleratorforrealtimeobjectdetectiononedgedevices AT jangsungjoon lightweightandenergyefficientdeeplearningacceleratorforrealtimeobjectdetectiononedgedevices AT parkjonghee lightweightandenergyefficientdeeplearningacceleratorforrealtimeobjectdetectiononedgedevices AT leeeunchong lightweightandenergyefficientdeeplearningacceleratorforrealtimeobjectdetectiononedgedevices AT leesangseol lightweightandenergyefficientdeeplearningacceleratorforrealtimeobjectdetectiononedgedevices

Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices

Ejemplares similares