Cargando…

Fully Cross-Attention Transformer for Guided Depth Super-Resolution

Modern depth sensors are often characterized by low spatial resolution, which hinders their use in real-world applications. However, the depth map in many scenarios is accompanied by a corresponding high-resolution color image. In light of this, learning-based methods have been extensively used for...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ariav, Ido, Cohen, Israel
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10007518/ https://www.ncbi.nlm.nih.gov/pubmed/36904930 http://dx.doi.org/10.3390/s23052723

_version_	1784905541552701440
author	Ariav, Ido Cohen, Israel
author_facet	Ariav, Ido Cohen, Israel
author_sort	Ariav, Ido
collection	PubMed
description	Modern depth sensors are often characterized by low spatial resolution, which hinders their use in real-world applications. However, the depth map in many scenarios is accompanied by a corresponding high-resolution color image. In light of this, learning-based methods have been extensively used for guided super-resolution of depth maps. A guided super-resolution scheme uses a corresponding high-resolution color image to infer high-resolution depth maps from low-resolution ones. Unfortunately, these methods still have texture copying problems due to improper guidance from color images. Specifically, in most existing methods, guidance from the color image is achieved by a naive concatenation of color and depth features. In this paper, we propose a fully transformer-based network for depth map super-resolution. A cascaded transformer module extracts deep features from a low-resolution depth. It incorporates a novel cross-attention mechanism to seamlessly and continuously guide the color image into the depth upsampling process. Using a window partitioning scheme, linear complexity in image resolution can be achieved, so it can be applied to high-resolution images. The proposed method of guided depth super-resolution outperforms other state-of-the-art methods through extensive experiments.
format	Online Article Text
id	pubmed-10007518
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-100075182023-03-12 Fully Cross-Attention Transformer for Guided Depth Super-Resolution Ariav, Ido Cohen, Israel Sensors (Basel) Article Modern depth sensors are often characterized by low spatial resolution, which hinders their use in real-world applications. However, the depth map in many scenarios is accompanied by a corresponding high-resolution color image. In light of this, learning-based methods have been extensively used for guided super-resolution of depth maps. A guided super-resolution scheme uses a corresponding high-resolution color image to infer high-resolution depth maps from low-resolution ones. Unfortunately, these methods still have texture copying problems due to improper guidance from color images. Specifically, in most existing methods, guidance from the color image is achieved by a naive concatenation of color and depth features. In this paper, we propose a fully transformer-based network for depth map super-resolution. A cascaded transformer module extracts deep features from a low-resolution depth. It incorporates a novel cross-attention mechanism to seamlessly and continuously guide the color image into the depth upsampling process. Using a window partitioning scheme, linear complexity in image resolution can be achieved, so it can be applied to high-resolution images. The proposed method of guided depth super-resolution outperforms other state-of-the-art methods through extensive experiments. MDPI 2023-03-02 /pmc/articles/PMC10007518/ /pubmed/36904930 http://dx.doi.org/10.3390/s23052723 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Ariav, Ido Cohen, Israel Fully Cross-Attention Transformer for Guided Depth Super-Resolution
title	Fully Cross-Attention Transformer for Guided Depth Super-Resolution
title_full	Fully Cross-Attention Transformer for Guided Depth Super-Resolution
title_fullStr	Fully Cross-Attention Transformer for Guided Depth Super-Resolution
title_full_unstemmed	Fully Cross-Attention Transformer for Guided Depth Super-Resolution
title_short	Fully Cross-Attention Transformer for Guided Depth Super-Resolution
title_sort	fully cross-attention transformer for guided depth super-resolution
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10007518/ https://www.ncbi.nlm.nih.gov/pubmed/36904930 http://dx.doi.org/10.3390/s23052723
work_keys_str_mv	AT ariavido fullycrossattentiontransformerforguideddepthsuperresolution AT cohenisrael fullycrossattentiontransformerforguideddepthsuperresolution

Fully Cross-Attention Transformer for Guided Depth Super-Resolution

Ejemplares similares