RAFT: Recurrent All-Pairs Field Transforms for Optical Flow
About
We introduce Recurrent All-Pairs Field Transforms (RAFT), a new deep network architecture for optical flow. RAFT extracts per-pixel features, builds multi-scale 4D correlation volumes for all pairs of pixels, and iteratively updates a flow field through a recurrent unit that performs lookups on the correlation volumes. RAFT achieves state-of-the-art performance. On KITTI, RAFT achieves an F1-all error of 5.10%, a 16% error reduction from the best published result (6.10%). On Sintel (final pass), RAFT obtains an end-point-error of 2.855 pixels, a 30% error reduction from the best published result (4.098 pixels). In addition, RAFT has strong cross-dataset generalization as well as high efficiency in inference time, training speed, and parameter count. Code is available at https://github.com/princeton-vl/RAFT.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Optical Flow Estimation | KITTI 2015 (train) | Fl-epe0.63 | 431 | |
| Optical Flow Estimation | MPI Sintel Final (train) | Endpoint Error (EPE)2.71 | 209 | |
| Optical Flow Estimation | MPI Sintel Clean (train) | EPE1.43 | 202 | |
| Optical Flow | Sintel (train) | AEPE (Clean)0.76 | 179 | |
| Optical Flow | Sintel (test) | AEPE (Final)2.86 | 120 | |
| Optical Flow | KITTI 2012 (train) | -- | 115 | |
| Optical Flow Estimation | Sintel clean (test) | EPE1.61 | 103 | |
| Optical Flow Estimation | Sintel Final (test) | EPE2.71 | 101 | |
| Optical Flow | KITTI 2015 (test) | Fl Error (All)5.1 | 95 | |
| Optical Flow | Sintel Final (train) | EPE1.2 | 92 |