Cargando…

Auto-Tiler: Variable-Dimension Autoencoder with Tiling for Compressing Intermediate Feature Space of Deep Neural Networks for Internet of Things

Due to limited resources of the Internet of Things (IoT) edge devices, deep neural network (DNN) inference requires collaboration with cloud server platforms, where DNN inference is partitioned and offloaded to high-performance servers to reduce end-to-end latency. As data-intensive intermediate fea...

Descripción completa

Detalles Bibliográficos
Autores principales:	Park, Jeongsoo, Kim, Jungrae, Ko, Jong Hwan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7866250/ https://www.ncbi.nlm.nih.gov/pubmed/33572784 http://dx.doi.org/10.3390/s21030896

_version_	1783648037983223808
author	Park, Jeongsoo Kim, Jungrae Ko, Jong Hwan
author_facet	Park, Jeongsoo Kim, Jungrae Ko, Jong Hwan
author_sort	Park, Jeongsoo
collection	PubMed
description	Due to limited resources of the Internet of Things (IoT) edge devices, deep neural network (DNN) inference requires collaboration with cloud server platforms, where DNN inference is partitioned and offloaded to high-performance servers to reduce end-to-end latency. As data-intensive intermediate feature space at the partitioned layer should be transmitted to the servers, efficient compression of the feature space is imperative for high-throughput inference. However, the feature space at deeper layers has different characteristics than natural images, limiting the compression performance by conventional preprocessing and encoding techniques. To tackle this limitation, we introduce a new method for compressing DNN intermediate feature space using a specialized autoencoder, called auto-tiler. The proposed auto-tiler is designed to include the tiling process and provide multiple input/output dimensions to support various partitioned layers and compression ratios. The results show that auto-tiler achieves 18% to 67% higher percent point accuracy compared to the existing methods at the same bitrate while reducing the process latency by 73% to 81%. The dimension variability of an auto-tiler also reduces the storage overhead by 62% with negligible accuracy loss.
format	Online Article Text
id	pubmed-7866250
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-78662502021-02-07 Auto-Tiler: Variable-Dimension Autoencoder with Tiling for Compressing Intermediate Feature Space of Deep Neural Networks for Internet of Things Park, Jeongsoo Kim, Jungrae Ko, Jong Hwan Sensors (Basel) Article Due to limited resources of the Internet of Things (IoT) edge devices, deep neural network (DNN) inference requires collaboration with cloud server platforms, where DNN inference is partitioned and offloaded to high-performance servers to reduce end-to-end latency. As data-intensive intermediate feature space at the partitioned layer should be transmitted to the servers, efficient compression of the feature space is imperative for high-throughput inference. However, the feature space at deeper layers has different characteristics than natural images, limiting the compression performance by conventional preprocessing and encoding techniques. To tackle this limitation, we introduce a new method for compressing DNN intermediate feature space using a specialized autoencoder, called auto-tiler. The proposed auto-tiler is designed to include the tiling process and provide multiple input/output dimensions to support various partitioned layers and compression ratios. The results show that auto-tiler achieves 18% to 67% higher percent point accuracy compared to the existing methods at the same bitrate while reducing the process latency by 73% to 81%. The dimension variability of an auto-tiler also reduces the storage overhead by 62% with negligible accuracy loss. MDPI 2021-01-29 /pmc/articles/PMC7866250/ /pubmed/33572784 http://dx.doi.org/10.3390/s21030896 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Park, Jeongsoo Kim, Jungrae Ko, Jong Hwan Auto-Tiler: Variable-Dimension Autoencoder with Tiling for Compressing Intermediate Feature Space of Deep Neural Networks for Internet of Things
title	Auto-Tiler: Variable-Dimension Autoencoder with Tiling for Compressing Intermediate Feature Space of Deep Neural Networks for Internet of Things
title_full	Auto-Tiler: Variable-Dimension Autoencoder with Tiling for Compressing Intermediate Feature Space of Deep Neural Networks for Internet of Things
title_fullStr	Auto-Tiler: Variable-Dimension Autoencoder with Tiling for Compressing Intermediate Feature Space of Deep Neural Networks for Internet of Things
title_full_unstemmed	Auto-Tiler: Variable-Dimension Autoencoder with Tiling for Compressing Intermediate Feature Space of Deep Neural Networks for Internet of Things
title_short	Auto-Tiler: Variable-Dimension Autoencoder with Tiling for Compressing Intermediate Feature Space of Deep Neural Networks for Internet of Things
title_sort	auto-tiler: variable-dimension autoencoder with tiling for compressing intermediate feature space of deep neural networks for internet of things
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7866250/ https://www.ncbi.nlm.nih.gov/pubmed/33572784 http://dx.doi.org/10.3390/s21030896
work_keys_str_mv	AT parkjeongsoo autotilervariabledimensionautoencoderwithtilingforcompressingintermediatefeaturespaceofdeepneuralnetworksforinternetofthings AT kimjungrae autotilervariabledimensionautoencoderwithtilingforcompressingintermediatefeaturespaceofdeepneuralnetworksforinternetofthings AT kojonghwan autotilervariabledimensionautoencoderwithtilingforcompressingintermediatefeaturespaceofdeepneuralnetworksforinternetofthings

Auto-Tiler: Variable-Dimension Autoencoder with Tiling for Compressing Intermediate Feature Space of Deep Neural Networks for Internet of Things

Ejemplares similares