Video Enhancement with Task-Oriented Flow
About
Many video enhancement algorithms rely on optical flow to register frames in a video sequence. Precise flow estimation is however intractable; and optical flow itself is often a sub-optimal representation for particular video processing tasks. In this paper, we propose task-oriented flow (TOFlow), a motion representation learned in a self-supervised, task-specific manner. We design a neural network with a trainable motion estimation component and a video processing component, and train them jointly to learn the task-oriented flow. For evaluation, we build Vimeo-90K, a large-scale, high-quality video dataset for low-level video processing. TOFlow outperforms traditional optical flow on standard benchmarks as well as our Vimeo-90K dataset in three video processing tasks: frame interpolation, video denoising/deblocking, and video super-resolution.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Video Super-Resolution | Vid4 (test) | PSNR25.89 | 173 | |
| Video Frame Interpolation | Vimeo90K (test) | PSNR33.73 | 131 | |
| Video Frame Interpolation | UCF101 | PSNR34.58 | 117 | |
| Video Super-Resolution | REDS4 (test) | PSNR (Avg)27.98 | 117 | |
| Video Super-Resolution | REDS4 4x (test) | PSNR27.98 | 96 | |
| Video Super-Resolution | REDS4 | SSIM0.799 | 82 | |
| Video Super-Resolution | Vimeo-90K-T (test) | PSNR34.83 | 82 | |
| Video Interpolation | UCF-101 (test) | PSNR34.58 | 65 | |
| Video Frame Interpolation | Vimeo90K | PSNR33.73 | 62 | |
| Video Frame Interpolation | SNU-FILM Easy | PSNR39.08 | 59 |