Cargando…

Adapting Single-Image Super-Resolution Models to Video Super-Resolution: A Plug-and-Play Approach

The quality of videos varies due to the different capabilities of sensors. Video super-resolution (VSR) is a technology that improves the quality of captured video. However, the development of a VSR model is very costly. In this paper, we present a novel approach for adapting single-image super-reso...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Wenhao, Liu, Zhenbing, Lu, Haoxiang, Lan, Rushi, Huang, Yingxin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10255317/
https://www.ncbi.nlm.nih.gov/pubmed/37299757
http://dx.doi.org/10.3390/s23115030
_version_ 1785056841744515072
author Wang, Wenhao
Liu, Zhenbing
Lu, Haoxiang
Lan, Rushi
Huang, Yingxin
author_facet Wang, Wenhao
Liu, Zhenbing
Lu, Haoxiang
Lan, Rushi
Huang, Yingxin
author_sort Wang, Wenhao
collection PubMed
description The quality of videos varies due to the different capabilities of sensors. Video super-resolution (VSR) is a technology that improves the quality of captured video. However, the development of a VSR model is very costly. In this paper, we present a novel approach for adapting single-image super-resolution (SISR) models to the VSR task. To achieve this, we first summarize a common architecture of SISR models and perform a formal analysis of adaptation. Then, we propose an adaptation method that incorporates a plug-and-play temporal feature extraction module into existing SISR models. The proposed temporal feature extraction module consists of three submodules: offset estimation, spatial aggregation, and temporal aggregation. In the spatial aggregation submodule, the features obtained from the SISR model are aligned to the center frame based on the offset estimation results. The aligned features are fused in the temporal aggregation submodule. Finally, the fused temporal feature is fed to the SISR model for reconstruction. To evaluate the effectiveness of our method, we adapt five representative SISR models and evaluate these models on two popular benchmarks. The experiment results show the proposed method is effective on different SISR models. In particular, on the Vid4 benchmark, the VSR-adapted models achieve at least 1.26 dB and 0.067 improvement over the original SISR models in terms of PSNR and SSIM metrics, respectively. Additionally, these VSR-adapted models achieve better performance than the state-of-the-art VSR models.
format Online
Article
Text
id pubmed-10255317
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-102553172023-06-10 Adapting Single-Image Super-Resolution Models to Video Super-Resolution: A Plug-and-Play Approach Wang, Wenhao Liu, Zhenbing Lu, Haoxiang Lan, Rushi Huang, Yingxin Sensors (Basel) Article The quality of videos varies due to the different capabilities of sensors. Video super-resolution (VSR) is a technology that improves the quality of captured video. However, the development of a VSR model is very costly. In this paper, we present a novel approach for adapting single-image super-resolution (SISR) models to the VSR task. To achieve this, we first summarize a common architecture of SISR models and perform a formal analysis of adaptation. Then, we propose an adaptation method that incorporates a plug-and-play temporal feature extraction module into existing SISR models. The proposed temporal feature extraction module consists of three submodules: offset estimation, spatial aggregation, and temporal aggregation. In the spatial aggregation submodule, the features obtained from the SISR model are aligned to the center frame based on the offset estimation results. The aligned features are fused in the temporal aggregation submodule. Finally, the fused temporal feature is fed to the SISR model for reconstruction. To evaluate the effectiveness of our method, we adapt five representative SISR models and evaluate these models on two popular benchmarks. The experiment results show the proposed method is effective on different SISR models. In particular, on the Vid4 benchmark, the VSR-adapted models achieve at least 1.26 dB and 0.067 improvement over the original SISR models in terms of PSNR and SSIM metrics, respectively. Additionally, these VSR-adapted models achieve better performance than the state-of-the-art VSR models. MDPI 2023-05-24 /pmc/articles/PMC10255317/ /pubmed/37299757 http://dx.doi.org/10.3390/s23115030 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Wenhao
Liu, Zhenbing
Lu, Haoxiang
Lan, Rushi
Huang, Yingxin
Adapting Single-Image Super-Resolution Models to Video Super-Resolution: A Plug-and-Play Approach
title Adapting Single-Image Super-Resolution Models to Video Super-Resolution: A Plug-and-Play Approach
title_full Adapting Single-Image Super-Resolution Models to Video Super-Resolution: A Plug-and-Play Approach
title_fullStr Adapting Single-Image Super-Resolution Models to Video Super-Resolution: A Plug-and-Play Approach
title_full_unstemmed Adapting Single-Image Super-Resolution Models to Video Super-Resolution: A Plug-and-Play Approach
title_short Adapting Single-Image Super-Resolution Models to Video Super-Resolution: A Plug-and-Play Approach
title_sort adapting single-image super-resolution models to video super-resolution: a plug-and-play approach
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10255317/
https://www.ncbi.nlm.nih.gov/pubmed/37299757
http://dx.doi.org/10.3390/s23115030
work_keys_str_mv AT wangwenhao adaptingsingleimagesuperresolutionmodelstovideosuperresolutionaplugandplayapproach
AT liuzhenbing adaptingsingleimagesuperresolutionmodelstovideosuperresolutionaplugandplayapproach
AT luhaoxiang adaptingsingleimagesuperresolutionmodelstovideosuperresolutionaplugandplayapproach
AT lanrushi adaptingsingleimagesuperresolutionmodelstovideosuperresolutionaplugandplayapproach
AT huangyingxin adaptingsingleimagesuperresolutionmodelstovideosuperresolutionaplugandplayapproach