Cargando…
Simplification of Deep Neural Network-Based Object Detector for Real-Time Edge Computing
This paper presents a method for simplifying and quantizing a deep neural network (DNN)-based object detector to embed it into a real-time edge device. For network simplification, this paper compares five methods for applying channel pruning to a residual block because special care must be taken reg...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10099208/ https://www.ncbi.nlm.nih.gov/pubmed/37050837 http://dx.doi.org/10.3390/s23073777 |
_version_ | 1785025004251906048 |
---|---|
author | Choi, Kyoungtaek Wi, Seong Min Jung, Ho Gi Suhr, Jae Kyu |
author_facet | Choi, Kyoungtaek Wi, Seong Min Jung, Ho Gi Suhr, Jae Kyu |
author_sort | Choi, Kyoungtaek |
collection | PubMed |
description | This paper presents a method for simplifying and quantizing a deep neural network (DNN)-based object detector to embed it into a real-time edge device. For network simplification, this paper compares five methods for applying channel pruning to a residual block because special care must be taken regarding the number of channels when summing two feature maps. Based on the comparison in terms of detection performance, parameter number, computational complexity, and processing time, this paper discovers the most satisfying method on the edge device. For network quantization, this paper compares post-training quantization (PTQ) and quantization-aware training (QAT) using two datasets with different detection difficulties. This comparison shows that both approaches are recommended in the case of the easy-to-detect dataset, but QAT is preferable in the case of the difficult-to-detect dataset. Through experiments, this paper shows that the proposed method can effectively embed the DNN-based object detector into an edge device equipped with Qualcomm’s QCS605 System-on-Chip (SoC), while achieving a real-time operation with more than 10 frames per second. |
format | Online Article Text |
id | pubmed-10099208 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-100992082023-04-14 Simplification of Deep Neural Network-Based Object Detector for Real-Time Edge Computing Choi, Kyoungtaek Wi, Seong Min Jung, Ho Gi Suhr, Jae Kyu Sensors (Basel) Article This paper presents a method for simplifying and quantizing a deep neural network (DNN)-based object detector to embed it into a real-time edge device. For network simplification, this paper compares five methods for applying channel pruning to a residual block because special care must be taken regarding the number of channels when summing two feature maps. Based on the comparison in terms of detection performance, parameter number, computational complexity, and processing time, this paper discovers the most satisfying method on the edge device. For network quantization, this paper compares post-training quantization (PTQ) and quantization-aware training (QAT) using two datasets with different detection difficulties. This comparison shows that both approaches are recommended in the case of the easy-to-detect dataset, but QAT is preferable in the case of the difficult-to-detect dataset. Through experiments, this paper shows that the proposed method can effectively embed the DNN-based object detector into an edge device equipped with Qualcomm’s QCS605 System-on-Chip (SoC), while achieving a real-time operation with more than 10 frames per second. MDPI 2023-04-06 /pmc/articles/PMC10099208/ /pubmed/37050837 http://dx.doi.org/10.3390/s23073777 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Choi, Kyoungtaek Wi, Seong Min Jung, Ho Gi Suhr, Jae Kyu Simplification of Deep Neural Network-Based Object Detector for Real-Time Edge Computing |
title | Simplification of Deep Neural Network-Based Object Detector for Real-Time Edge Computing |
title_full | Simplification of Deep Neural Network-Based Object Detector for Real-Time Edge Computing |
title_fullStr | Simplification of Deep Neural Network-Based Object Detector for Real-Time Edge Computing |
title_full_unstemmed | Simplification of Deep Neural Network-Based Object Detector for Real-Time Edge Computing |
title_short | Simplification of Deep Neural Network-Based Object Detector for Real-Time Edge Computing |
title_sort | simplification of deep neural network-based object detector for real-time edge computing |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10099208/ https://www.ncbi.nlm.nih.gov/pubmed/37050837 http://dx.doi.org/10.3390/s23073777 |
work_keys_str_mv | AT choikyoungtaek simplificationofdeepneuralnetworkbasedobjectdetectorforrealtimeedgecomputing AT wiseongmin simplificationofdeepneuralnetworkbasedobjectdetectorforrealtimeedgecomputing AT junghogi simplificationofdeepneuralnetworkbasedobjectdetectorforrealtimeedgecomputing AT suhrjaekyu simplificationofdeepneuralnetworkbasedobjectdetectorforrealtimeedgecomputing |