Cargando…

Monocular Depth Estimation: Lightweight Convolutional and Matrix Capsule Feature-Fusion Network

This paper reports a study that aims to solve the problem of the weak adaptability to angle transformation of current monocular depth estimation algorithms. These algorithms are based on convolutional neural networks (CNNs) but produce results lacking in estimation accuracy and robustness. The paper...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wang, Yinchu, Zhu, Haijiang
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9459913/ https://www.ncbi.nlm.nih.gov/pubmed/36080801 http://dx.doi.org/10.3390/s22176344

_version_	1784786621488431104
author	Wang, Yinchu Zhu, Haijiang
author_facet	Wang, Yinchu Zhu, Haijiang
author_sort	Wang, Yinchu
collection	PubMed
description	This paper reports a study that aims to solve the problem of the weak adaptability to angle transformation of current monocular depth estimation algorithms. These algorithms are based on convolutional neural networks (CNNs) but produce results lacking in estimation accuracy and robustness. The paper proposes a lightweight network based on convolution and capsule feature fusion (CNNapsule). First, the paper introduces a fusion block module that integrates CNN features and matrix capsule features to improve the adaptability of the network to perspective transformations. The fusion and deconvolution features are fused through skip connections to generate a depth image. In addition, the corresponding loss function is designed according to the long-tail distribution, gradient similarity, and structural similarity of the datasets. Finally, the results are compared with the methods applied to the NYU Depth V2 and KITTI datasets and show that our proposed method has better accuracy on the C1 and C2 indices and a better visual effect than traditional methods and deep learning methods without transfer learning. The number of trainable parameters required by this method is 65% lower than that required by methods presented in the literature. The generalization of this method is verified via the comparative testing of the data collected from the internet and mobile phones.
format	Online Article Text
id	pubmed-9459913
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-94599132022-09-10 Monocular Depth Estimation: Lightweight Convolutional and Matrix Capsule Feature-Fusion Network Wang, Yinchu Zhu, Haijiang Sensors (Basel) Article This paper reports a study that aims to solve the problem of the weak adaptability to angle transformation of current monocular depth estimation algorithms. These algorithms are based on convolutional neural networks (CNNs) but produce results lacking in estimation accuracy and robustness. The paper proposes a lightweight network based on convolution and capsule feature fusion (CNNapsule). First, the paper introduces a fusion block module that integrates CNN features and matrix capsule features to improve the adaptability of the network to perspective transformations. The fusion and deconvolution features are fused through skip connections to generate a depth image. In addition, the corresponding loss function is designed according to the long-tail distribution, gradient similarity, and structural similarity of the datasets. Finally, the results are compared with the methods applied to the NYU Depth V2 and KITTI datasets and show that our proposed method has better accuracy on the C1 and C2 indices and a better visual effect than traditional methods and deep learning methods without transfer learning. The number of trainable parameters required by this method is 65% lower than that required by methods presented in the literature. The generalization of this method is verified via the comparative testing of the data collected from the internet and mobile phones. MDPI 2022-08-23 /pmc/articles/PMC9459913/ /pubmed/36080801 http://dx.doi.org/10.3390/s22176344 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Wang, Yinchu Zhu, Haijiang Monocular Depth Estimation: Lightweight Convolutional and Matrix Capsule Feature-Fusion Network
title	Monocular Depth Estimation: Lightweight Convolutional and Matrix Capsule Feature-Fusion Network
title_full	Monocular Depth Estimation: Lightweight Convolutional and Matrix Capsule Feature-Fusion Network
title_fullStr	Monocular Depth Estimation: Lightweight Convolutional and Matrix Capsule Feature-Fusion Network
title_full_unstemmed	Monocular Depth Estimation: Lightweight Convolutional and Matrix Capsule Feature-Fusion Network
title_short	Monocular Depth Estimation: Lightweight Convolutional and Matrix Capsule Feature-Fusion Network
title_sort	monocular depth estimation: lightweight convolutional and matrix capsule feature-fusion network
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9459913/ https://www.ncbi.nlm.nih.gov/pubmed/36080801 http://dx.doi.org/10.3390/s22176344
work_keys_str_mv	AT wangyinchu monoculardepthestimationlightweightconvolutionalandmatrixcapsulefeaturefusionnetwork AT zhuhaijiang monoculardepthestimationlightweightconvolutionalandmatrixcapsulefeaturefusionnetwork

Monocular Depth Estimation: Lightweight Convolutional and Matrix Capsule Feature-Fusion Network

Ejemplares similares