Cargando…
Adapting Single-Image Super-Resolution Models to Video Super-Resolution: A Plug-and-Play Approach
The quality of videos varies due to the different capabilities of sensors. Video super-resolution (VSR) is a technology that improves the quality of captured video. However, the development of a VSR model is very costly. In this paper, we present a novel approach for adapting single-image super-reso...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10255317/ https://www.ncbi.nlm.nih.gov/pubmed/37299757 http://dx.doi.org/10.3390/s23115030 |
_version_ | 1785056841744515072 |
---|---|
author | Wang, Wenhao Liu, Zhenbing Lu, Haoxiang Lan, Rushi Huang, Yingxin |
author_facet | Wang, Wenhao Liu, Zhenbing Lu, Haoxiang Lan, Rushi Huang, Yingxin |
author_sort | Wang, Wenhao |
collection | PubMed |
description | The quality of videos varies due to the different capabilities of sensors. Video super-resolution (VSR) is a technology that improves the quality of captured video. However, the development of a VSR model is very costly. In this paper, we present a novel approach for adapting single-image super-resolution (SISR) models to the VSR task. To achieve this, we first summarize a common architecture of SISR models and perform a formal analysis of adaptation. Then, we propose an adaptation method that incorporates a plug-and-play temporal feature extraction module into existing SISR models. The proposed temporal feature extraction module consists of three submodules: offset estimation, spatial aggregation, and temporal aggregation. In the spatial aggregation submodule, the features obtained from the SISR model are aligned to the center frame based on the offset estimation results. The aligned features are fused in the temporal aggregation submodule. Finally, the fused temporal feature is fed to the SISR model for reconstruction. To evaluate the effectiveness of our method, we adapt five representative SISR models and evaluate these models on two popular benchmarks. The experiment results show the proposed method is effective on different SISR models. In particular, on the Vid4 benchmark, the VSR-adapted models achieve at least 1.26 dB and 0.067 improvement over the original SISR models in terms of PSNR and SSIM metrics, respectively. Additionally, these VSR-adapted models achieve better performance than the state-of-the-art VSR models. |
format | Online Article Text |
id | pubmed-10255317 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-102553172023-06-10 Adapting Single-Image Super-Resolution Models to Video Super-Resolution: A Plug-and-Play Approach Wang, Wenhao Liu, Zhenbing Lu, Haoxiang Lan, Rushi Huang, Yingxin Sensors (Basel) Article The quality of videos varies due to the different capabilities of sensors. Video super-resolution (VSR) is a technology that improves the quality of captured video. However, the development of a VSR model is very costly. In this paper, we present a novel approach for adapting single-image super-resolution (SISR) models to the VSR task. To achieve this, we first summarize a common architecture of SISR models and perform a formal analysis of adaptation. Then, we propose an adaptation method that incorporates a plug-and-play temporal feature extraction module into existing SISR models. The proposed temporal feature extraction module consists of three submodules: offset estimation, spatial aggregation, and temporal aggregation. In the spatial aggregation submodule, the features obtained from the SISR model are aligned to the center frame based on the offset estimation results. The aligned features are fused in the temporal aggregation submodule. Finally, the fused temporal feature is fed to the SISR model for reconstruction. To evaluate the effectiveness of our method, we adapt five representative SISR models and evaluate these models on two popular benchmarks. The experiment results show the proposed method is effective on different SISR models. In particular, on the Vid4 benchmark, the VSR-adapted models achieve at least 1.26 dB and 0.067 improvement over the original SISR models in terms of PSNR and SSIM metrics, respectively. Additionally, these VSR-adapted models achieve better performance than the state-of-the-art VSR models. MDPI 2023-05-24 /pmc/articles/PMC10255317/ /pubmed/37299757 http://dx.doi.org/10.3390/s23115030 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Wang, Wenhao Liu, Zhenbing Lu, Haoxiang Lan, Rushi Huang, Yingxin Adapting Single-Image Super-Resolution Models to Video Super-Resolution: A Plug-and-Play Approach |
title | Adapting Single-Image Super-Resolution Models to Video Super-Resolution: A Plug-and-Play Approach |
title_full | Adapting Single-Image Super-Resolution Models to Video Super-Resolution: A Plug-and-Play Approach |
title_fullStr | Adapting Single-Image Super-Resolution Models to Video Super-Resolution: A Plug-and-Play Approach |
title_full_unstemmed | Adapting Single-Image Super-Resolution Models to Video Super-Resolution: A Plug-and-Play Approach |
title_short | Adapting Single-Image Super-Resolution Models to Video Super-Resolution: A Plug-and-Play Approach |
title_sort | adapting single-image super-resolution models to video super-resolution: a plug-and-play approach |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10255317/ https://www.ncbi.nlm.nih.gov/pubmed/37299757 http://dx.doi.org/10.3390/s23115030 |
work_keys_str_mv | AT wangwenhao adaptingsingleimagesuperresolutionmodelstovideosuperresolutionaplugandplayapproach AT liuzhenbing adaptingsingleimagesuperresolutionmodelstovideosuperresolutionaplugandplayapproach AT luhaoxiang adaptingsingleimagesuperresolutionmodelstovideosuperresolutionaplugandplayapproach AT lanrushi adaptingsingleimagesuperresolutionmodelstovideosuperresolutionaplugandplayapproach AT huangyingxin adaptingsingleimagesuperresolutionmodelstovideosuperresolutionaplugandplayapproach |