Cargando…
Evaluation of Deep Neural Network Compression Methods for Edge Devices Using Weighted Score-Based Ranking Scheme
The demand for object detection capability in edge computing systems has surged. As such, the need for lightweight Convolutional Neural Network (CNN)-based object detection models has become a focal point. Current models are large in memory and deployment in edge devices is demanding. This shows tha...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8622199/ https://www.ncbi.nlm.nih.gov/pubmed/34833610 http://dx.doi.org/10.3390/s21227529 |
_version_ | 1784605638636077056 |
---|---|
author | Ademola, Olutosin Ajibola Leier, Mairo Petlenkov, Eduard |
author_facet | Ademola, Olutosin Ajibola Leier, Mairo Petlenkov, Eduard |
author_sort | Ademola, Olutosin Ajibola |
collection | PubMed |
description | The demand for object detection capability in edge computing systems has surged. As such, the need for lightweight Convolutional Neural Network (CNN)-based object detection models has become a focal point. Current models are large in memory and deployment in edge devices is demanding. This shows that the models need to be optimized for the hardware without performance degradation. There exist several model compression methods; however, determining the most efficient method is of major concern. Our goal was to rank the performance of these methods using our application as a case study. We aimed to develop a real-time vehicle tracking system for cargo ships. To address this, we developed a weighted score-based ranking scheme that utilizes the model performance metrics. We demonstrated the effectiveness of this method by applying it on the baseline, compressed, and micro-CNN models trained on our dataset. The result showed that quantization is the most efficient compression method for the application, having the highest rank, with an average weighted score of 9.00, followed by binarization, having an average weighted score of 8.07. Our proposed method is extendable and can be used as a framework for the selection of suitable model compression methods for edge devices in different applications. |
format | Online Article Text |
id | pubmed-8622199 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-86221992021-11-27 Evaluation of Deep Neural Network Compression Methods for Edge Devices Using Weighted Score-Based Ranking Scheme Ademola, Olutosin Ajibola Leier, Mairo Petlenkov, Eduard Sensors (Basel) Article The demand for object detection capability in edge computing systems has surged. As such, the need for lightweight Convolutional Neural Network (CNN)-based object detection models has become a focal point. Current models are large in memory and deployment in edge devices is demanding. This shows that the models need to be optimized for the hardware without performance degradation. There exist several model compression methods; however, determining the most efficient method is of major concern. Our goal was to rank the performance of these methods using our application as a case study. We aimed to develop a real-time vehicle tracking system for cargo ships. To address this, we developed a weighted score-based ranking scheme that utilizes the model performance metrics. We demonstrated the effectiveness of this method by applying it on the baseline, compressed, and micro-CNN models trained on our dataset. The result showed that quantization is the most efficient compression method for the application, having the highest rank, with an average weighted score of 9.00, followed by binarization, having an average weighted score of 8.07. Our proposed method is extendable and can be used as a framework for the selection of suitable model compression methods for edge devices in different applications. MDPI 2021-11-12 /pmc/articles/PMC8622199/ /pubmed/34833610 http://dx.doi.org/10.3390/s21227529 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Ademola, Olutosin Ajibola Leier, Mairo Petlenkov, Eduard Evaluation of Deep Neural Network Compression Methods for Edge Devices Using Weighted Score-Based Ranking Scheme |
title | Evaluation of Deep Neural Network Compression Methods for Edge Devices Using Weighted Score-Based Ranking Scheme |
title_full | Evaluation of Deep Neural Network Compression Methods for Edge Devices Using Weighted Score-Based Ranking Scheme |
title_fullStr | Evaluation of Deep Neural Network Compression Methods for Edge Devices Using Weighted Score-Based Ranking Scheme |
title_full_unstemmed | Evaluation of Deep Neural Network Compression Methods for Edge Devices Using Weighted Score-Based Ranking Scheme |
title_short | Evaluation of Deep Neural Network Compression Methods for Edge Devices Using Weighted Score-Based Ranking Scheme |
title_sort | evaluation of deep neural network compression methods for edge devices using weighted score-based ranking scheme |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8622199/ https://www.ncbi.nlm.nih.gov/pubmed/34833610 http://dx.doi.org/10.3390/s21227529 |
work_keys_str_mv | AT ademolaolutosinajibola evaluationofdeepneuralnetworkcompressionmethodsforedgedevicesusingweightedscorebasedrankingscheme AT leiermairo evaluationofdeepneuralnetworkcompressionmethodsforedgedevicesusingweightedscorebasedrankingscheme AT petlenkoveduard evaluationofdeepneuralnetworkcompressionmethodsforedgedevicesusingweightedscorebasedrankingscheme |