Cargando…

Evaluation of Deep Neural Network Compression Methods for Edge Devices Using Weighted Score-Based Ranking Scheme

The demand for object detection capability in edge computing systems has surged. As such, the need for lightweight Convolutional Neural Network (CNN)-based object detection models has become a focal point. Current models are large in memory and deployment in edge devices is demanding. This shows tha...

Descripción completa

Detalles Bibliográficos
Autores principales: Ademola, Olutosin Ajibola, Leier, Mairo, Petlenkov, Eduard
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8622199/
https://www.ncbi.nlm.nih.gov/pubmed/34833610
http://dx.doi.org/10.3390/s21227529
_version_ 1784605638636077056
author Ademola, Olutosin Ajibola
Leier, Mairo
Petlenkov, Eduard
author_facet Ademola, Olutosin Ajibola
Leier, Mairo
Petlenkov, Eduard
author_sort Ademola, Olutosin Ajibola
collection PubMed
description The demand for object detection capability in edge computing systems has surged. As such, the need for lightweight Convolutional Neural Network (CNN)-based object detection models has become a focal point. Current models are large in memory and deployment in edge devices is demanding. This shows that the models need to be optimized for the hardware without performance degradation. There exist several model compression methods; however, determining the most efficient method is of major concern. Our goal was to rank the performance of these methods using our application as a case study. We aimed to develop a real-time vehicle tracking system for cargo ships. To address this, we developed a weighted score-based ranking scheme that utilizes the model performance metrics. We demonstrated the effectiveness of this method by applying it on the baseline, compressed, and micro-CNN models trained on our dataset. The result showed that quantization is the most efficient compression method for the application, having the highest rank, with an average weighted score of 9.00, followed by binarization, having an average weighted score of 8.07. Our proposed method is extendable and can be used as a framework for the selection of suitable model compression methods for edge devices in different applications.
format Online
Article
Text
id pubmed-8622199
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-86221992021-11-27 Evaluation of Deep Neural Network Compression Methods for Edge Devices Using Weighted Score-Based Ranking Scheme Ademola, Olutosin Ajibola Leier, Mairo Petlenkov, Eduard Sensors (Basel) Article The demand for object detection capability in edge computing systems has surged. As such, the need for lightweight Convolutional Neural Network (CNN)-based object detection models has become a focal point. Current models are large in memory and deployment in edge devices is demanding. This shows that the models need to be optimized for the hardware without performance degradation. There exist several model compression methods; however, determining the most efficient method is of major concern. Our goal was to rank the performance of these methods using our application as a case study. We aimed to develop a real-time vehicle tracking system for cargo ships. To address this, we developed a weighted score-based ranking scheme that utilizes the model performance metrics. We demonstrated the effectiveness of this method by applying it on the baseline, compressed, and micro-CNN models trained on our dataset. The result showed that quantization is the most efficient compression method for the application, having the highest rank, with an average weighted score of 9.00, followed by binarization, having an average weighted score of 8.07. Our proposed method is extendable and can be used as a framework for the selection of suitable model compression methods for edge devices in different applications. MDPI 2021-11-12 /pmc/articles/PMC8622199/ /pubmed/34833610 http://dx.doi.org/10.3390/s21227529 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Ademola, Olutosin Ajibola
Leier, Mairo
Petlenkov, Eduard
Evaluation of Deep Neural Network Compression Methods for Edge Devices Using Weighted Score-Based Ranking Scheme
title Evaluation of Deep Neural Network Compression Methods for Edge Devices Using Weighted Score-Based Ranking Scheme
title_full Evaluation of Deep Neural Network Compression Methods for Edge Devices Using Weighted Score-Based Ranking Scheme
title_fullStr Evaluation of Deep Neural Network Compression Methods for Edge Devices Using Weighted Score-Based Ranking Scheme
title_full_unstemmed Evaluation of Deep Neural Network Compression Methods for Edge Devices Using Weighted Score-Based Ranking Scheme
title_short Evaluation of Deep Neural Network Compression Methods for Edge Devices Using Weighted Score-Based Ranking Scheme
title_sort evaluation of deep neural network compression methods for edge devices using weighted score-based ranking scheme
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8622199/
https://www.ncbi.nlm.nih.gov/pubmed/34833610
http://dx.doi.org/10.3390/s21227529
work_keys_str_mv AT ademolaolutosinajibola evaluationofdeepneuralnetworkcompressionmethodsforedgedevicesusingweightedscorebasedrankingscheme
AT leiermairo evaluationofdeepneuralnetworkcompressionmethodsforedgedevicesusingweightedscorebasedrankingscheme
AT petlenkoveduard evaluationofdeepneuralnetworkcompressionmethodsforedgedevicesusingweightedscorebasedrankingscheme