Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MuCAN: Multi-Correspondence Aggregation Network for Video Super-Resolution

About

Video super-resolution (VSR) aims to utilize multiple low-resolution frames to generate a high-resolution prediction for each frame. In this process, inter- and intra-frames are the key sources for exploiting temporal and spatial information. However, there are a couple of limitations for existing VSR methods. First, optical flow is often used to establish temporal correspondence. But flow estimation itself is error-prone and affects recovery results. Second, similar patterns existing in natural images are rarely exploited for the VSR task. Motivated by these findings, we propose a temporal multi-correspondence aggregation strategy to leverage similar patches across frames, and a cross-scale nonlocal-correspondence aggregation scheme to explore self-similarity of images across scales. Based on these two new modules, we build an effective multi-correspondence aggregation network (MuCAN) for VSR. Our method achieves state-of-the-art results on multiple benchmark datasets. Extensive experiments justify the effectiveness of our method.

Wenbo Li, Xin Tao, Taian Guo, Lu Qi, Jiangbo Lu, Jiaya Jia• 2020

Related benchmarks

TaskDatasetResultRank
Video Super-ResolutionVid4 (test)
PSNR27.26
173
Video Super-ResolutionREDS4 (test)
PSNR (Avg)30.88
117
Video Super-ResolutionREDS4 4x (test)
PSNR30.88
96
Video Super-ResolutionREDS4
SSIM0.875
82
Video Super-ResolutionVimeo-90K-T (test)
PSNR37.32
82
Video Super-ResolutionVimeo-90K-T BI degradation (test)
PSNR37.32
47
Video Super-ResolutionREDS4 (val)
Average PSNR30.88
41
Video Super-ResolutionVid4
Average Y PSNR27.26
32
Video Super-ResolutionVimeo-90K-T 87 (test)
PSNR37.32
32
Video Super-ResolutionVimeo-90K-T BI degradation, Y channel (test)
PSNR37.32
30
Showing 10 of 24 rows

Other info

Follow for update