MuCAN: Multi-Correspondence Aggregation Network for Video Super-Resolution

About

Video super-resolution (VSR) aims to utilize multiple low-resolution frames to generate a high-resolution prediction for each frame. In this process, inter- and intra-frames are the key sources for exploiting temporal and spatial information. However, there are a couple of limitations for existing VSR methods. First, optical flow is often used to establish temporal correspondence. But flow estimation itself is error-prone and affects recovery results. Second, similar patterns existing in natural images are rarely exploited for the VSR task. Motivated by these findings, we propose a temporal multi-correspondence aggregation strategy to leverage similar patches across frames, and a cross-scale nonlocal-correspondence aggregation scheme to explore self-similarity of images across scales. Based on these two new modules, we build an effective multi-correspondence aggregation network (MuCAN) for VSR. Our method achieves state-of-the-art results on multiple benchmark datasets. Extensive experiments justify the effectiveness of our method.

Wenbo Li, Xin Tao, Taian Guo, Lu Qi, Jiangbo Lu, Jiaya Jia• 2020

Related benchmarks

Task	Dataset	Result
Video Super-Resolution	REDS4 (test)	PSNR (Avg)30.88	231
Video Super-Resolution	Vid4 (test)	PSNR27.26	206
Video Super-Resolution	REDS4	SSIM0.875	118
Video Super-Resolution	REDS4 4x (test)	PSNR30.88	96
Video Super-Resolution	Vimeo-90K-T (test)	PSNR37.32	92
Video Super-Resolution	Vimeo-90K-T BI degradation (test)	PSNR37.32	47
Video Super-Resolution	Vid4	PSNR25.85	46
Video Super-Resolution	REDS4 (val)	Average PSNR30.88	41
Video Super-Resolution	Vimeo-90K-T 87 (test)	PSNR37.32	32
Video Super-Resolution	Vimeo-90K-T BI degradation, Y channel (test)	PSNR37.32	30

Showing 10 of 24 rows

Other info

Follow for update

@wizwand_team Discord