OCAI: Improving Optical Flow Estimation by Occlusion and Consistency Aware Interpolation
About
The scarcity of ground-truth labels poses one major challenge in developing optical flow estimation models that are both generalizable and robust. While current methods rely on data augmentation, they have yet to fully exploit the rich information available in labeled video sequences. We propose OCAI, a method that supports robust frame interpolation by generating intermediate video frames alongside optical flows in between. Utilizing a forward warping approach, OCAI employs occlusion awareness to resolve ambiguities in pixel values and fills in missing values by leveraging the forward-backward consistency of optical flows. Additionally, we introduce a teacher-student style semi-supervised learning method on top of the interpolated frames. Using a pair of unlabeled frames and the teacher model's predicted optical flow, we generate interpolated frames and flows to train a student model. The teacher's weights are maintained using Exponential Moving Averaging of the student. Our evaluations demonstrate perceptually superior interpolation quality and enhanced optical flow accuracy on established benchmarks such as Sintel and KITTI.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Optical Flow | Sintel (test) | AEPE (Final)2.63 | 120 | |
| Optical Flow | Sintel Final (train) | EPE2.32 | 92 | |
| Optical Flow | Sintel Clean (train) | EPE1.2 | 85 | |
| Optical Flow | KITTI (train) | Fl-all0.076 | 63 | |
| Optical Flow | KITTI (test) | -- | 28 | |
| Video Frame Interpolation | KITTI 5FPS -> 10 FPS (test) | PSNR22.08 | 8 | |
| Video Frame Interpolation | Sintel Clean 6FPS -> 12 FPS (test) | PSNR25.88 | 8 | |
| Video Frame Interpolation | Sintel Clean 12FPS -> 24 FPS (test) | PSNR29.47 | 8 | |
| Optical Flow | SlowFlow 100px 3BD (train) | EPE2.97 | 3 | |
| Optical Flow | SlowFlow (100px) 5BD (train) | Endpoint Error (EPE)5.04 | 3 |