Mitigating Artifacts in Real-World Video Super-resolution Models
DOI:
https://doi.org/10.1609/aaai.v37i3.25398Keywords:
CV: Low Level & Physics-Based VisionAbstract
The recurrent structure is a prevalent framework for the task of video super-resolution, which models the temporal dependency between frames via hidden states. When applied to real-world scenarios with unknown and complex degradations, hidden states tend to contain unpleasant artifacts and propagate them to restored frames. In this circumstance, our analyses show that such artifacts can be largely alleviated when the hidden state is replaced with a cleaner counterpart. Based on the observations, we propose a Hidden State Attention (HSA) module to mitigate artifacts in real-world video super-resolution. Specifically, we first adopt various cheap filters to produce a hidden state pool. For example, Gaussian blur filters are for smoothing artifacts while sharpening filters are for enhancing details. To aggregate a new hidden state that contains fewer artifacts from the hidden state pool, we devise a Selective Cross Attention (SCA) module, in which the attention between input features and each hidden state is calculated. Equipped with HSA, our proposed method, namely FastRealVSR, is able to achieve 2x speedup while obtaining better performance than Real-BasicVSR. Codes will be available at https://github.com/TencentARC/FastRealVSR.Downloads
Published
2023-06-26
How to Cite
Xie, L., Wang, X., Shi, S., Gu, J., Dong, C., & Shan, Y. (2023). Mitigating Artifacts in Real-World Video Super-resolution Models. Proceedings of the AAAI Conference on Artificial Intelligence, 37(3), 2956-2964. https://doi.org/10.1609/aaai.v37i3.25398
Issue
Section
AAAI Technical Track on Computer Vision III