Cargando…

DTS-Depth: Real-Time Single-Image Depth Estimation Using Depth-to-Space Image Construction

As most of the recent high-resolution depth-estimation algorithms are computationally so expensive that they cannot work in real time, the common solution is using a low-resolution input image to reduce the computational complexity. We propose a different approach, an efficient and real-time convolu...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ibrahem, Hatem, Salem, Ahmed, Kang, Hyun-Soo
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8914965/ https://www.ncbi.nlm.nih.gov/pubmed/35271061 http://dx.doi.org/10.3390/s22051914

_version_	1784667888844537856
author	Ibrahem, Hatem Salem, Ahmed Kang, Hyun-Soo
author_facet	Ibrahem, Hatem Salem, Ahmed Kang, Hyun-Soo
author_sort	Ibrahem, Hatem
collection	PubMed
description	As most of the recent high-resolution depth-estimation algorithms are computationally so expensive that they cannot work in real time, the common solution is using a low-resolution input image to reduce the computational complexity. We propose a different approach, an efficient and real-time convolutional neural network-based depth-estimation algorithm using a single high-resolution image as the input. The proposed method efficiently constructs a high-resolution depth map using a small encoding architecture and eliminates the need for a decoder, which is typically used in the encoder–decoder architectures employed for depth estimation. The proposed algorithm adopts a modified MobileNetV2 architecture, which is a lightweight architecture, to estimate the depth information through the depth-to-space image construction, which is generally employed in image super-resolution. As a result, it realizes fast frame processing and can predict a high-accuracy depth in real time. We train and test our method on the challenging KITTI, Cityscapes, and NYUV2 depth datasets. The proposed method achieves low relative absolute error (0.028 for KITTI, 0.167 for CITYSCAPES, and 0.069 for NYUV2) while working at speed reaching 48 frames per second on a GPU and 20 frames per second on a CPU for high-resolution test images. We compare our method with the state-of-the-art methods on depth estimation, showing that our method outperforms those methods. However, the architecture is less complex and works in real time.
format	Online Article Text
id	pubmed-8914965
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-89149652022-03-12 DTS-Depth: Real-Time Single-Image Depth Estimation Using Depth-to-Space Image Construction Ibrahem, Hatem Salem, Ahmed Kang, Hyun-Soo Sensors (Basel) Article As most of the recent high-resolution depth-estimation algorithms are computationally so expensive that they cannot work in real time, the common solution is using a low-resolution input image to reduce the computational complexity. We propose a different approach, an efficient and real-time convolutional neural network-based depth-estimation algorithm using a single high-resolution image as the input. The proposed method efficiently constructs a high-resolution depth map using a small encoding architecture and eliminates the need for a decoder, which is typically used in the encoder–decoder architectures employed for depth estimation. The proposed algorithm adopts a modified MobileNetV2 architecture, which is a lightweight architecture, to estimate the depth information through the depth-to-space image construction, which is generally employed in image super-resolution. As a result, it realizes fast frame processing and can predict a high-accuracy depth in real time. We train and test our method on the challenging KITTI, Cityscapes, and NYUV2 depth datasets. The proposed method achieves low relative absolute error (0.028 for KITTI, 0.167 for CITYSCAPES, and 0.069 for NYUV2) while working at speed reaching 48 frames per second on a GPU and 20 frames per second on a CPU for high-resolution test images. We compare our method with the state-of-the-art methods on depth estimation, showing that our method outperforms those methods. However, the architecture is less complex and works in real time. MDPI 2022-03-01 /pmc/articles/PMC8914965/ /pubmed/35271061 http://dx.doi.org/10.3390/s22051914 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Ibrahem, Hatem Salem, Ahmed Kang, Hyun-Soo DTS-Depth: Real-Time Single-Image Depth Estimation Using Depth-to-Space Image Construction
title	DTS-Depth: Real-Time Single-Image Depth Estimation Using Depth-to-Space Image Construction
title_full	DTS-Depth: Real-Time Single-Image Depth Estimation Using Depth-to-Space Image Construction
title_fullStr	DTS-Depth: Real-Time Single-Image Depth Estimation Using Depth-to-Space Image Construction
title_full_unstemmed	DTS-Depth: Real-Time Single-Image Depth Estimation Using Depth-to-Space Image Construction
title_short	DTS-Depth: Real-Time Single-Image Depth Estimation Using Depth-to-Space Image Construction
title_sort	dts-depth: real-time single-image depth estimation using depth-to-space image construction
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8914965/ https://www.ncbi.nlm.nih.gov/pubmed/35271061 http://dx.doi.org/10.3390/s22051914
work_keys_str_mv	AT ibrahemhatem dtsdepthrealtimesingleimagedepthestimationusingdepthtospaceimageconstruction AT salemahmed dtsdepthrealtimesingleimagedepthestimationusingdepthtospaceimageconstruction AT kanghyunsoo dtsdepthrealtimesingleimagedepthestimationusingdepthtospaceimageconstruction

DTS-Depth: Real-Time Single-Image Depth Estimation Using Depth-to-Space Image Construction

Ejemplares similares