Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation
About
Convolutional neural networks have enabled accurate image super-resolution in real-time. However, recent attempts to benefit from temporal correlations in video super-resolution have been limited to naive or inefficient architectures. In this paper, we introduce spatio-temporal sub-pixel convolution networks that effectively exploit temporal redundancies and improve reconstruction accuracy while maintaining real-time speed. Specifically, we discuss the use of early fusion, slow fusion and 3D convolutions for the joint processing of multiple consecutive video frames. We also propose a novel joint motion compensation and video super-resolution algorithm that is orders of magnitude more efficient than competing methods, relying on a fast multi-resolution spatial transformer module that is end-to-end trainable. These contributions provide both higher accuracy and temporally more consistent videos, which we confirm qualitatively and quantitatively. Relative to single-frame models, spatio-temporal networks can either reduce the computational cost by 30% whilst maintaining the same quality or provide a 0.2dB gain for a similar computational cost. Results on publicly available datasets demonstrate that the proposed algorithms surpass current state-of-the-art performance in both accuracy and efficiency.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Video Super-Resolution | Vid4 (test) | PSNR27.49 | 173 | |
| Video Super-Resolution | REDS4 4x (test) | PSNR31.67 | 96 | |
| Video Super-Resolution | Vid4 | Average Y PSNR24.95 | 32 | |
| Video Super-Resolution | Vid4 BI degradation (test) | PSNR25.35 | 31 | |
| 4x Video Super-Resolution | Vimeo-90K-T (test) | PSNR37.47 | 28 | |
| Video Super-Resolution | Vid4 4x (test) | PSNR27.39 | 19 | |
| Video Super-Resolution | Vid4 x4 upscaling (test) | PSNR25.35 | 18 | |
| Video Super-Resolution | Vid4 Scale x4 (test) | PSNR (City)26.17 | 16 | |
| Video Super-Resolution | SPMCS-11 | PSNR27.09 | 15 | |
| 4x Video Super-Resolution | REDS 4 | PSNR31.67 | 12 |