Real-Time Intermediate Flow Estimation for Video Frame Interpolation
About
Real-time video frame interpolation (VFI) is very useful in video processing, media players, and display devices. We propose RIFE, a Real-time Intermediate Flow Estimation algorithm for VFI. To realize a high-quality flow-based VFI method, RIFE uses a neural network named IFNet that can estimate the intermediate flows end-to-end with much faster speed. A privileged distillation scheme is designed for stable IFNet training and improve the overall performance. RIFE does not rely on pre-trained optical flow models and can support arbitrary-timestep frame interpolation with the temporal encoding input. Experiments demonstrate that RIFE achieves state-of-the-art performance on several public benchmarks. Compared with the popular SuperSlomo and DAIN methods, RIFE is 4--27 times faster and produces better results. Furthermore, RIFE can be extended to wider applications thanks to temporal encoding. The code is available at https://github.com/megvii-research/ECCV2022-RIFE.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Camera pose estimation | Sintel | ATE0.292 | 203 | |
| Video Frame Interpolation | Vimeo90K (test) | PSNR36.13 | 153 | |
| Monocular Depth Estimation | Sintel | Abs Rel0.381 | 127 | |
| Video Frame Interpolation | UCF101 | PSNR35.41 | 122 | |
| Continuous spatio-temporal video super-resolution | GoPro 85 (out-of-distribution) | PSNR30.16 | 80 | |
| Depth Estimation | BONN | Abs Rel0.075 | 63 | |
| Video Frame Interpolation | Vimeo90K | PSNR36.1 | 62 | |
| Video Depth Estimation | TUM dynamics | Abs Rel0.11 | 61 | |
| Camera pose estimation | TUM | ATE0.006 | 59 | |
| Video Frame Interpolation | SNU-FILM Easy | PSNR40.02 | 59 |