WAFT: Warping-Alone Field Transforms for Optical Flow
About
We introduce Warping-Alone Field Transforms (WAFT), a simple and effective method for optical flow. WAFT is similar to RAFT but replaces cost volume with high-resolution warping, achieving better accuracy with lower memory cost. This design challenges the conventional wisdom that constructing cost volumes is necessary for strong performance. WAFT is a simple and flexible meta-architecture with minimal inductive biases and reliance on custom designs. Compared with existing methods, WAFT ranks 1st on Spring, Sintel, and KITTI benchmarks, achieves the best zero-shot generalization on KITTI, while being 1.3-4.1x faster than existing methods that have competitive accuracy (e.g., 1.3x than Flowformer++, 4.1x than CCMR+). Code and model weights are available at \href{https://github.com/princeton-vl/WAFT}{https://github.com/princeton-vl/WAFT}.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Optical Flow Estimation | KITTI 2015 (train) | Fl-epe1.15 | 446 | |
| Optical Flow | Sintel (train) | AEPE (Clean)1.28 | 200 | |
| Optical Flow Estimation | Sintel Final (test) | EPE2.02 | 133 | |
| Optical Flow Estimation | Sintel clean (test) | EPE0.95 | 120 | |
| Optical Flow Estimation | KITTI 2015 (test) | Fl-all3.56 | 108 | |
| Optical Flow | KITTI (train) | Fl-all0.129 | 84 | |
| Optical Flow Estimation | KITTI 2015 | Fl-all3.31 | 60 | |
| Optical Flow | Sintel Clean | EPE0.94 | 59 | |
| Optical Flow | Sintel Final | EPE2.02 | 59 | |
| Point Tracking | DAVIS TAP-Vid | -- | 52 |