Mitigating Artifacts in Real-World Video Super-resolution Models

Authors

  • Liangbin Xie Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, China
  • Xintao Wang Tencent
  • Shuwei Shi Tsinghua University
  • Jinjin Gu The University of Sydney
  • Chao Dong SIAT
  • Ying Shan Tencent

DOI:

https://doi.org/10.1609/aaai.v37i3.25398

Keywords:

CV: Low Level & Physics-Based Vision

Abstract

The recurrent structure is a prevalent framework for the task of video super-resolution, which models the temporal dependency between frames via hidden states. When applied to real-world scenarios with unknown and complex degradations, hidden states tend to contain unpleasant artifacts and propagate them to restored frames. In this circumstance, our analyses show that such artifacts can be largely alleviated when the hidden state is replaced with a cleaner counterpart. Based on the observations, we propose a Hidden State Attention (HSA) module to mitigate artifacts in real-world video super-resolution. Specifically, we first adopt various cheap filters to produce a hidden state pool. For example, Gaussian blur filters are for smoothing artifacts while sharpening filters are for enhancing details. To aggregate a new hidden state that contains fewer artifacts from the hidden state pool, we devise a Selective Cross Attention (SCA) module, in which the attention between input features and each hidden state is calculated. Equipped with HSA, our proposed method, namely FastRealVSR, is able to achieve 2x speedup while obtaining better performance than Real-BasicVSR. Codes will be available at https://github.com/TencentARC/FastRealVSR.

Downloads

Published

2023-06-26

How to Cite

Xie, L., Wang, X., Shi, S., Gu, J., Dong, C., & Shan, Y. (2023). Mitigating Artifacts in Real-World Video Super-resolution Models. Proceedings of the AAAI Conference on Artificial Intelligence, 37(3), 2956-2964. https://doi.org/10.1609/aaai.v37i3.25398

Issue

Section

AAAI Technical Track on Computer Vision III