Cargando…

Fully Cross-Attention Transformer for Guided Depth Super-Resolution

Modern depth sensors are often characterized by low spatial resolution, which hinders their use in real-world applications. However, the depth map in many scenarios is accompanied by a corresponding high-resolution color image. In light of this, learning-based methods have been extensively used for...

Descripción completa

Detalles Bibliográficos
Autores principales: Ariav, Ido, Cohen, Israel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10007518/
https://www.ncbi.nlm.nih.gov/pubmed/36904930
http://dx.doi.org/10.3390/s23052723
_version_ 1784905541552701440
author Ariav, Ido
Cohen, Israel
author_facet Ariav, Ido
Cohen, Israel
author_sort Ariav, Ido
collection PubMed
description Modern depth sensors are often characterized by low spatial resolution, which hinders their use in real-world applications. However, the depth map in many scenarios is accompanied by a corresponding high-resolution color image. In light of this, learning-based methods have been extensively used for guided super-resolution of depth maps. A guided super-resolution scheme uses a corresponding high-resolution color image to infer high-resolution depth maps from low-resolution ones. Unfortunately, these methods still have texture copying problems due to improper guidance from color images. Specifically, in most existing methods, guidance from the color image is achieved by a naive concatenation of color and depth features. In this paper, we propose a fully transformer-based network for depth map super-resolution. A cascaded transformer module extracts deep features from a low-resolution depth. It incorporates a novel cross-attention mechanism to seamlessly and continuously guide the color image into the depth upsampling process. Using a window partitioning scheme, linear complexity in image resolution can be achieved, so it can be applied to high-resolution images. The proposed method of guided depth super-resolution outperforms other state-of-the-art methods through extensive experiments.
format Online
Article
Text
id pubmed-10007518
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100075182023-03-12 Fully Cross-Attention Transformer for Guided Depth Super-Resolution Ariav, Ido Cohen, Israel Sensors (Basel) Article Modern depth sensors are often characterized by low spatial resolution, which hinders their use in real-world applications. However, the depth map in many scenarios is accompanied by a corresponding high-resolution color image. In light of this, learning-based methods have been extensively used for guided super-resolution of depth maps. A guided super-resolution scheme uses a corresponding high-resolution color image to infer high-resolution depth maps from low-resolution ones. Unfortunately, these methods still have texture copying problems due to improper guidance from color images. Specifically, in most existing methods, guidance from the color image is achieved by a naive concatenation of color and depth features. In this paper, we propose a fully transformer-based network for depth map super-resolution. A cascaded transformer module extracts deep features from a low-resolution depth. It incorporates a novel cross-attention mechanism to seamlessly and continuously guide the color image into the depth upsampling process. Using a window partitioning scheme, linear complexity in image resolution can be achieved, so it can be applied to high-resolution images. The proposed method of guided depth super-resolution outperforms other state-of-the-art methods through extensive experiments. MDPI 2023-03-02 /pmc/articles/PMC10007518/ /pubmed/36904930 http://dx.doi.org/10.3390/s23052723 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Ariav, Ido
Cohen, Israel
Fully Cross-Attention Transformer for Guided Depth Super-Resolution
title Fully Cross-Attention Transformer for Guided Depth Super-Resolution
title_full Fully Cross-Attention Transformer for Guided Depth Super-Resolution
title_fullStr Fully Cross-Attention Transformer for Guided Depth Super-Resolution
title_full_unstemmed Fully Cross-Attention Transformer for Guided Depth Super-Resolution
title_short Fully Cross-Attention Transformer for Guided Depth Super-Resolution
title_sort fully cross-attention transformer for guided depth super-resolution
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10007518/
https://www.ncbi.nlm.nih.gov/pubmed/36904930
http://dx.doi.org/10.3390/s23052723
work_keys_str_mv AT ariavido fullycrossattentiontransformerforguideddepthsuperresolution
AT cohenisrael fullycrossattentiontransformerforguideddepthsuperresolution