Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation
About
Effectively extracting inter-frame motion and appearance information is important for video frame interpolation (VFI). Previous works either extract both types of information in a mixed way or elaborate separate modules for each type of information, which lead to representation ambiguity and low efficiency. In this paper, we propose a novel module to explicitly extract motion and appearance information via a unifying operation. Specifically, we rethink the information process in inter-frame attention and reuse its attention map for both appearance feature enhancement and motion information extraction. Furthermore, for efficient VFI, our proposed module could be seamlessly integrated into a hybrid CNN and Transformer architecture. This hybrid pipeline can alleviate the computational complexity of inter-frame attention as well as preserve detailed low-level structure information. Experimental results demonstrate that, for both fixed- and arbitrary-timestep interpolation, our method achieves state-of-the-art performance on various datasets. Meanwhile, our approach enjoys a lighter computation overhead over models with close performance. The source code and models are available at https://github.com/MCG-NJU/EMA-VFI.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Video Frame Interpolation | Vimeo90K (test) | PSNR36.64 | 131 | |
| Video Frame Interpolation | Vimeo90K | PSNR36.64 | 62 | |
| Video Frame Interpolation | SNU-FILM Extreme | PSNR25.69 | 59 | |
| Video Frame Interpolation | SNU-FILM Hard | PSNR30.94 | 59 | |
| Video Frame Interpolation | SNU-FILM Medium | PSNR36.09 | 59 | |
| Video Frame Interpolation | SNU-FILM Easy | PSNR39.98 | 59 | |
| Multi-frame Video Interpolation | X 4K (test) | PSNR31.46 | 43 | |
| Video Frame Interpolation | UCF101 (test) | PSNR35.48 | 41 | |
| Video Frame Interpolation | Xiph-2k | PSNR36.9 | 29 | |
| Video Frame Interpolation | X 2K (test) | PSNR32.85 | 29 |