Cargando…

Simplification of Deep Neural Network-Based Object Detector for Real-Time Edge Computing

This paper presents a method for simplifying and quantizing a deep neural network (DNN)-based object detector to embed it into a real-time edge device. For network simplification, this paper compares five methods for applying channel pruning to a residual block because special care must be taken reg...

Descripción completa

Detalles Bibliográficos
Autores principales: Choi, Kyoungtaek, Wi, Seong Min, Jung, Ho Gi, Suhr, Jae Kyu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10099208/
https://www.ncbi.nlm.nih.gov/pubmed/37050837
http://dx.doi.org/10.3390/s23073777
_version_ 1785025004251906048
author Choi, Kyoungtaek
Wi, Seong Min
Jung, Ho Gi
Suhr, Jae Kyu
author_facet Choi, Kyoungtaek
Wi, Seong Min
Jung, Ho Gi
Suhr, Jae Kyu
author_sort Choi, Kyoungtaek
collection PubMed
description This paper presents a method for simplifying and quantizing a deep neural network (DNN)-based object detector to embed it into a real-time edge device. For network simplification, this paper compares five methods for applying channel pruning to a residual block because special care must be taken regarding the number of channels when summing two feature maps. Based on the comparison in terms of detection performance, parameter number, computational complexity, and processing time, this paper discovers the most satisfying method on the edge device. For network quantization, this paper compares post-training quantization (PTQ) and quantization-aware training (QAT) using two datasets with different detection difficulties. This comparison shows that both approaches are recommended in the case of the easy-to-detect dataset, but QAT is preferable in the case of the difficult-to-detect dataset. Through experiments, this paper shows that the proposed method can effectively embed the DNN-based object detector into an edge device equipped with Qualcomm’s QCS605 System-on-Chip (SoC), while achieving a real-time operation with more than 10 frames per second.
format Online
Article
Text
id pubmed-10099208
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100992082023-04-14 Simplification of Deep Neural Network-Based Object Detector for Real-Time Edge Computing Choi, Kyoungtaek Wi, Seong Min Jung, Ho Gi Suhr, Jae Kyu Sensors (Basel) Article This paper presents a method for simplifying and quantizing a deep neural network (DNN)-based object detector to embed it into a real-time edge device. For network simplification, this paper compares five methods for applying channel pruning to a residual block because special care must be taken regarding the number of channels when summing two feature maps. Based on the comparison in terms of detection performance, parameter number, computational complexity, and processing time, this paper discovers the most satisfying method on the edge device. For network quantization, this paper compares post-training quantization (PTQ) and quantization-aware training (QAT) using two datasets with different detection difficulties. This comparison shows that both approaches are recommended in the case of the easy-to-detect dataset, but QAT is preferable in the case of the difficult-to-detect dataset. Through experiments, this paper shows that the proposed method can effectively embed the DNN-based object detector into an edge device equipped with Qualcomm’s QCS605 System-on-Chip (SoC), while achieving a real-time operation with more than 10 frames per second. MDPI 2023-04-06 /pmc/articles/PMC10099208/ /pubmed/37050837 http://dx.doi.org/10.3390/s23073777 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Choi, Kyoungtaek
Wi, Seong Min
Jung, Ho Gi
Suhr, Jae Kyu
Simplification of Deep Neural Network-Based Object Detector for Real-Time Edge Computing
title Simplification of Deep Neural Network-Based Object Detector for Real-Time Edge Computing
title_full Simplification of Deep Neural Network-Based Object Detector for Real-Time Edge Computing
title_fullStr Simplification of Deep Neural Network-Based Object Detector for Real-Time Edge Computing
title_full_unstemmed Simplification of Deep Neural Network-Based Object Detector for Real-Time Edge Computing
title_short Simplification of Deep Neural Network-Based Object Detector for Real-Time Edge Computing
title_sort simplification of deep neural network-based object detector for real-time edge computing
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10099208/
https://www.ncbi.nlm.nih.gov/pubmed/37050837
http://dx.doi.org/10.3390/s23073777
work_keys_str_mv AT choikyoungtaek simplificationofdeepneuralnetworkbasedobjectdetectorforrealtimeedgecomputing
AT wiseongmin simplificationofdeepneuralnetworkbasedobjectdetectorforrealtimeedgecomputing
AT junghogi simplificationofdeepneuralnetworkbasedobjectdetectorforrealtimeedgecomputing
AT suhrjaekyu simplificationofdeepneuralnetworkbasedobjectdetectorforrealtimeedgecomputing