Cargando…

DTS-Net: Depth-to-Space Networks for Fast and Accurate Semantic Object Segmentation

We propose Depth-to-Space Net (DTS-Net), an effective technique for semantic segmentation using the efficient sub-pixel convolutional neural network. This technique is inspired by depth-to-space (DTS) image reconstruction, which was originally used for image and video super-resolution tasks, combine...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ibrahem, Hatem, Salem, Ahmed, Kang, Hyun-Soo
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8749585/ https://www.ncbi.nlm.nih.gov/pubmed/35009879 http://dx.doi.org/10.3390/s22010337

_version_	1784631265392066560
author	Ibrahem, Hatem Salem, Ahmed Kang, Hyun-Soo
author_facet	Ibrahem, Hatem Salem, Ahmed Kang, Hyun-Soo
author_sort	Ibrahem, Hatem
collection	PubMed
description	We propose Depth-to-Space Net (DTS-Net), an effective technique for semantic segmentation using the efficient sub-pixel convolutional neural network. This technique is inspired by depth-to-space (DTS) image reconstruction, which was originally used for image and video super-resolution tasks, combined with a mask enhancement filtration technique based on multi-label classification, namely, Nearest Label Filtration. In the proposed technique, we employ depth-wise separable convolution-based architectures. We propose both a deep network, that is, DTS-Net, and a lightweight network, DTS-Net-Lite, for real-time semantic segmentation; these networks employ Xception and MobileNetV2 architectures as the feature extractors, respectively. In addition, we explore the joint semantic segmentation and depth estimation task and demonstrate that the proposed technique can efficiently perform both tasks simultaneously, outperforming state-of-art (SOTA) methods. We train and evaluate the performance of the proposed method on the PASCAL VOC2012, NYUV2, and CITYSCAPES benchmarks. Hence, we obtain high mean intersection over union (mIOU) and mean pixel accuracy (Pix.acc.) values using simple and lightweight convolutional neural network architectures of the developed networks. Notably, the proposed method outperforms SOTA methods that depend on encoder–decoder architectures, although our implementation and computations are far simpler.
format	Online Article Text
id	pubmed-8749585
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-87495852022-01-12 DTS-Net: Depth-to-Space Networks for Fast and Accurate Semantic Object Segmentation Ibrahem, Hatem Salem, Ahmed Kang, Hyun-Soo Sensors (Basel) Article We propose Depth-to-Space Net (DTS-Net), an effective technique for semantic segmentation using the efficient sub-pixel convolutional neural network. This technique is inspired by depth-to-space (DTS) image reconstruction, which was originally used for image and video super-resolution tasks, combined with a mask enhancement filtration technique based on multi-label classification, namely, Nearest Label Filtration. In the proposed technique, we employ depth-wise separable convolution-based architectures. We propose both a deep network, that is, DTS-Net, and a lightweight network, DTS-Net-Lite, for real-time semantic segmentation; these networks employ Xception and MobileNetV2 architectures as the feature extractors, respectively. In addition, we explore the joint semantic segmentation and depth estimation task and demonstrate that the proposed technique can efficiently perform both tasks simultaneously, outperforming state-of-art (SOTA) methods. We train and evaluate the performance of the proposed method on the PASCAL VOC2012, NYUV2, and CITYSCAPES benchmarks. Hence, we obtain high mean intersection over union (mIOU) and mean pixel accuracy (Pix.acc.) values using simple and lightweight convolutional neural network architectures of the developed networks. Notably, the proposed method outperforms SOTA methods that depend on encoder–decoder architectures, although our implementation and computations are far simpler. MDPI 2022-01-03 /pmc/articles/PMC8749585/ /pubmed/35009879 http://dx.doi.org/10.3390/s22010337 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Ibrahem, Hatem Salem, Ahmed Kang, Hyun-Soo DTS-Net: Depth-to-Space Networks for Fast and Accurate Semantic Object Segmentation
title	DTS-Net: Depth-to-Space Networks for Fast and Accurate Semantic Object Segmentation
title_full	DTS-Net: Depth-to-Space Networks for Fast and Accurate Semantic Object Segmentation
title_fullStr	DTS-Net: Depth-to-Space Networks for Fast and Accurate Semantic Object Segmentation
title_full_unstemmed	DTS-Net: Depth-to-Space Networks for Fast and Accurate Semantic Object Segmentation
title_short	DTS-Net: Depth-to-Space Networks for Fast and Accurate Semantic Object Segmentation
title_sort	dts-net: depth-to-space networks for fast and accurate semantic object segmentation
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8749585/ https://www.ncbi.nlm.nih.gov/pubmed/35009879 http://dx.doi.org/10.3390/s22010337
work_keys_str_mv	AT ibrahemhatem dtsnetdepthtospacenetworksforfastandaccuratesemanticobjectsegmentation AT salemahmed dtsnetdepthtospacenetworksforfastandaccuratesemanticobjectsegmentation AT kanghyunsoo dtsnetdepthtospacenetworksforfastandaccuratesemanticobjectsegmentation

DTS-Net: Depth-to-Space Networks for Fast and Accurate Semantic Object Segmentation

Ejemplares similares