Cargando…

Nested DWT–Based CNN Architecture for Monocular Depth Estimation

Applications such as medical diagnosis, navigation, robotics, etc., require 3D images. Recently, deep learning networks have been extensively applied to estimate depth. Depth prediction from 2D images poses a problem that is both ill–posed and non–linear. Such networks are computationally and time–w...

Descripción completa

Detalles Bibliográficos
Autores principales: Paul, Sandip, Mishra, Deepak, Marimuthu, Senthil Kumar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10052892/
https://www.ncbi.nlm.nih.gov/pubmed/36991780
http://dx.doi.org/10.3390/s23063066
_version_ 1785015262446092288
author Paul, Sandip
Mishra, Deepak
Marimuthu, Senthil Kumar
author_facet Paul, Sandip
Mishra, Deepak
Marimuthu, Senthil Kumar
author_sort Paul, Sandip
collection PubMed
description Applications such as medical diagnosis, navigation, robotics, etc., require 3D images. Recently, deep learning networks have been extensively applied to estimate depth. Depth prediction from 2D images poses a problem that is both ill–posed and non–linear. Such networks are computationally and time–wise expensive as they have dense configurations. Further, the network performance depends on the trained model configuration, the loss functions used, and the dataset applied for training. We propose a moderately dense encoder–decoder network based on discrete wavelet decomposition and trainable coefficients (LL, LH, HL, HH). Our Nested Wavelet–Net (NDWTN) preserves the high–frequency information that is otherwise lost during the downsampling process in the encoder. Furthermore, we study the effect of activation functions, batch normalization, convolution layers, skip, etc., in our models. The network is trained with NYU datasets. Our network trains faster with good results.
format Online
Article
Text
id pubmed-10052892
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100528922023-03-30 Nested DWT–Based CNN Architecture for Monocular Depth Estimation Paul, Sandip Mishra, Deepak Marimuthu, Senthil Kumar Sensors (Basel) Article Applications such as medical diagnosis, navigation, robotics, etc., require 3D images. Recently, deep learning networks have been extensively applied to estimate depth. Depth prediction from 2D images poses a problem that is both ill–posed and non–linear. Such networks are computationally and time–wise expensive as they have dense configurations. Further, the network performance depends on the trained model configuration, the loss functions used, and the dataset applied for training. We propose a moderately dense encoder–decoder network based on discrete wavelet decomposition and trainable coefficients (LL, LH, HL, HH). Our Nested Wavelet–Net (NDWTN) preserves the high–frequency information that is otherwise lost during the downsampling process in the encoder. Furthermore, we study the effect of activation functions, batch normalization, convolution layers, skip, etc., in our models. The network is trained with NYU datasets. Our network trains faster with good results. MDPI 2023-03-13 /pmc/articles/PMC10052892/ /pubmed/36991780 http://dx.doi.org/10.3390/s23063066 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Paul, Sandip
Mishra, Deepak
Marimuthu, Senthil Kumar
Nested DWT–Based CNN Architecture for Monocular Depth Estimation
title Nested DWT–Based CNN Architecture for Monocular Depth Estimation
title_full Nested DWT–Based CNN Architecture for Monocular Depth Estimation
title_fullStr Nested DWT–Based CNN Architecture for Monocular Depth Estimation
title_full_unstemmed Nested DWT–Based CNN Architecture for Monocular Depth Estimation
title_short Nested DWT–Based CNN Architecture for Monocular Depth Estimation
title_sort nested dwt–based cnn architecture for monocular depth estimation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10052892/
https://www.ncbi.nlm.nih.gov/pubmed/36991780
http://dx.doi.org/10.3390/s23063066
work_keys_str_mv AT paulsandip nesteddwtbasedcnnarchitectureformonoculardepthestimation
AT mishradeepak nesteddwtbasedcnnarchitectureformonoculardepthestimation
AT marimuthusenthilkumar nesteddwtbasedcnnarchitectureformonoculardepthestimation