Cargando…

MLWAN: Multi-Scale Learning Wavelet Attention Module Network for Image Super Resolution

Image super resolution (SR) is an important image processing technique in computer vision to improve the resolution of images and videos. In recent years, deep convolutional neural network (CNN) has made significant progress in the field of image SR; however, the existing CNN-based SR methods cannot...

Descripción completa

Detalles Bibliográficos
Autores principales: Ma, Jian, Han, Xiyu, Zhang, Xiaoyin, Li, Zhipeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9741030/
https://www.ncbi.nlm.nih.gov/pubmed/36501811
http://dx.doi.org/10.3390/s22239110
_version_ 1784848215753883648
author Ma, Jian
Han, Xiyu
Zhang, Xiaoyin
Li, Zhipeng
author_facet Ma, Jian
Han, Xiyu
Zhang, Xiaoyin
Li, Zhipeng
author_sort Ma, Jian
collection PubMed
description Image super resolution (SR) is an important image processing technique in computer vision to improve the resolution of images and videos. In recent years, deep convolutional neural network (CNN) has made significant progress in the field of image SR; however, the existing CNN-based SR methods cannot fully search for background information in the measurement of feature extraction. In addition, in most cases, different scale factors of image SR are assumed to be different assignments and completed by training different models, which does not meet the actual application requirements. To solve these problems, we propose a multi-scale learning wavelet attention network (MLWAN) model for image SR. Specifically, the proposed model consists of three parts. In the first part, low-level features are extracted from the input image through two convolutional layers, and then a new channel-spatial attention mechanism (CSAM) block is concatenated. In the second part, CNN is used to predict the highest-level low-frequency wavelet coefficients, and the third part uses recursive neural networks (RNN) with different scales to predict the wavelet coefficients of the remaining subbands. In order to further achieve lightweight, an effective channel attention recurrent module (ECARM) is proposed to reduce network parameters. Finally, the inverse discrete wavelet transform (IDWT) is used to reconstruct HR image. Experimental results on public large-scale datasets demonstrate the superiority of the proposed model in terms of quantitative indicators and visual effects.
format Online
Article
Text
id pubmed-9741030
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-97410302022-12-11 MLWAN: Multi-Scale Learning Wavelet Attention Module Network for Image Super Resolution Ma, Jian Han, Xiyu Zhang, Xiaoyin Li, Zhipeng Sensors (Basel) Article Image super resolution (SR) is an important image processing technique in computer vision to improve the resolution of images and videos. In recent years, deep convolutional neural network (CNN) has made significant progress in the field of image SR; however, the existing CNN-based SR methods cannot fully search for background information in the measurement of feature extraction. In addition, in most cases, different scale factors of image SR are assumed to be different assignments and completed by training different models, which does not meet the actual application requirements. To solve these problems, we propose a multi-scale learning wavelet attention network (MLWAN) model for image SR. Specifically, the proposed model consists of three parts. In the first part, low-level features are extracted from the input image through two convolutional layers, and then a new channel-spatial attention mechanism (CSAM) block is concatenated. In the second part, CNN is used to predict the highest-level low-frequency wavelet coefficients, and the third part uses recursive neural networks (RNN) with different scales to predict the wavelet coefficients of the remaining subbands. In order to further achieve lightweight, an effective channel attention recurrent module (ECARM) is proposed to reduce network parameters. Finally, the inverse discrete wavelet transform (IDWT) is used to reconstruct HR image. Experimental results on public large-scale datasets demonstrate the superiority of the proposed model in terms of quantitative indicators and visual effects. MDPI 2022-11-24 /pmc/articles/PMC9741030/ /pubmed/36501811 http://dx.doi.org/10.3390/s22239110 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Ma, Jian
Han, Xiyu
Zhang, Xiaoyin
Li, Zhipeng
MLWAN: Multi-Scale Learning Wavelet Attention Module Network for Image Super Resolution
title MLWAN: Multi-Scale Learning Wavelet Attention Module Network for Image Super Resolution
title_full MLWAN: Multi-Scale Learning Wavelet Attention Module Network for Image Super Resolution
title_fullStr MLWAN: Multi-Scale Learning Wavelet Attention Module Network for Image Super Resolution
title_full_unstemmed MLWAN: Multi-Scale Learning Wavelet Attention Module Network for Image Super Resolution
title_short MLWAN: Multi-Scale Learning Wavelet Attention Module Network for Image Super Resolution
title_sort mlwan: multi-scale learning wavelet attention module network for image super resolution
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9741030/
https://www.ncbi.nlm.nih.gov/pubmed/36501811
http://dx.doi.org/10.3390/s22239110
work_keys_str_mv AT majian mlwanmultiscalelearningwaveletattentionmodulenetworkforimagesuperresolution
AT hanxiyu mlwanmultiscalelearningwaveletattentionmodulenetworkforimagesuperresolution
AT zhangxiaoyin mlwanmultiscalelearningwaveletattentionmodulenetworkforimagesuperresolution
AT lizhipeng mlwanmultiscalelearningwaveletattentionmodulenetworkforimagesuperresolution