Cargando…
Fully Cross-Attention Transformer for Guided Depth Super-Resolution
Modern depth sensors are often characterized by low spatial resolution, which hinders their use in real-world applications. However, the depth map in many scenarios is accompanied by a corresponding high-resolution color image. In light of this, learning-based methods have been extensively used for...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10007518/ https://www.ncbi.nlm.nih.gov/pubmed/36904930 http://dx.doi.org/10.3390/s23052723 |
_version_ | 1784905541552701440 |
---|---|
author | Ariav, Ido Cohen, Israel |
author_facet | Ariav, Ido Cohen, Israel |
author_sort | Ariav, Ido |
collection | PubMed |
description | Modern depth sensors are often characterized by low spatial resolution, which hinders their use in real-world applications. However, the depth map in many scenarios is accompanied by a corresponding high-resolution color image. In light of this, learning-based methods have been extensively used for guided super-resolution of depth maps. A guided super-resolution scheme uses a corresponding high-resolution color image to infer high-resolution depth maps from low-resolution ones. Unfortunately, these methods still have texture copying problems due to improper guidance from color images. Specifically, in most existing methods, guidance from the color image is achieved by a naive concatenation of color and depth features. In this paper, we propose a fully transformer-based network for depth map super-resolution. A cascaded transformer module extracts deep features from a low-resolution depth. It incorporates a novel cross-attention mechanism to seamlessly and continuously guide the color image into the depth upsampling process. Using a window partitioning scheme, linear complexity in image resolution can be achieved, so it can be applied to high-resolution images. The proposed method of guided depth super-resolution outperforms other state-of-the-art methods through extensive experiments. |
format | Online Article Text |
id | pubmed-10007518 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-100075182023-03-12 Fully Cross-Attention Transformer for Guided Depth Super-Resolution Ariav, Ido Cohen, Israel Sensors (Basel) Article Modern depth sensors are often characterized by low spatial resolution, which hinders their use in real-world applications. However, the depth map in many scenarios is accompanied by a corresponding high-resolution color image. In light of this, learning-based methods have been extensively used for guided super-resolution of depth maps. A guided super-resolution scheme uses a corresponding high-resolution color image to infer high-resolution depth maps from low-resolution ones. Unfortunately, these methods still have texture copying problems due to improper guidance from color images. Specifically, in most existing methods, guidance from the color image is achieved by a naive concatenation of color and depth features. In this paper, we propose a fully transformer-based network for depth map super-resolution. A cascaded transformer module extracts deep features from a low-resolution depth. It incorporates a novel cross-attention mechanism to seamlessly and continuously guide the color image into the depth upsampling process. Using a window partitioning scheme, linear complexity in image resolution can be achieved, so it can be applied to high-resolution images. The proposed method of guided depth super-resolution outperforms other state-of-the-art methods through extensive experiments. MDPI 2023-03-02 /pmc/articles/PMC10007518/ /pubmed/36904930 http://dx.doi.org/10.3390/s23052723 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Ariav, Ido Cohen, Israel Fully Cross-Attention Transformer for Guided Depth Super-Resolution |
title | Fully Cross-Attention Transformer for Guided Depth Super-Resolution |
title_full | Fully Cross-Attention Transformer for Guided Depth Super-Resolution |
title_fullStr | Fully Cross-Attention Transformer for Guided Depth Super-Resolution |
title_full_unstemmed | Fully Cross-Attention Transformer for Guided Depth Super-Resolution |
title_short | Fully Cross-Attention Transformer for Guided Depth Super-Resolution |
title_sort | fully cross-attention transformer for guided depth super-resolution |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10007518/ https://www.ncbi.nlm.nih.gov/pubmed/36904930 http://dx.doi.org/10.3390/s23052723 |
work_keys_str_mv | AT ariavido fullycrossattentiontransformerforguideddepthsuperresolution AT cohenisrael fullycrossattentiontransformerforguideddepthsuperresolution |