MaskFlownet: Asymmetric Feature Matching with Learnable Occlusion Mask
About
Feature warping is a core technique in optical flow estimation; however, the ambiguity caused by occluded areas during warping is a major problem that remains unsolved. In this paper, we propose an asymmetric occlusion-aware feature matching module, which can learn a rough occlusion mask that filters useless (occluded) areas immediately after feature warping without any explicit supervision. The proposed module can be easily integrated into end-to-end network architectures and enjoys performance gains while introducing negligible computational cost. The learned occlusion mask can be further fed into a subsequent network cascade with dual feature pyramids with which we achieve state-of-the-art performance. At the time of submission, our method, called MaskFlownet, surpasses all published optical flow methods on the MPI Sintel, KITTI 2012 and 2015 benchmarks. Code is available at https://github.com/microsoft/MaskFlownet.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Video Object Segmentation | DAVIS 2016 (val) | J Mean30 | 564 | |
| Optical Flow Estimation | KITTI 2015 (train) | -- | 431 | |
| Optical Flow Estimation | MPI Sintel Final (train) | Endpoint Error (EPE)3.61 | 209 | |
| Optical Flow Estimation | MPI Sintel Clean (train) | EPE2.25 | 202 | |
| Optical Flow | Sintel (train) | AEPE (Clean)2.25 | 179 | |
| Optical Flow | MPI Sintel Clean (test) | AEE2.52 | 158 | |
| Optical Flow | MPI-Sintel final (test) | -- | 137 | |
| Optical Flow | Sintel (test) | AEPE (Final)4.17 | 120 | |
| Optical Flow | KITTI 2012 (train) | AEE2.94 | 115 | |
| Optical Flow Estimation | Sintel clean (test) | EPE2.52 | 103 |