Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

RAFT: Recurrent All-Pairs Field Transforms for Optical Flow

About

We introduce Recurrent All-Pairs Field Transforms (RAFT), a new deep network architecture for optical flow. RAFT extracts per-pixel features, builds multi-scale 4D correlation volumes for all pairs of pixels, and iteratively updates a flow field through a recurrent unit that performs lookups on the correlation volumes. RAFT achieves state-of-the-art performance. On KITTI, RAFT achieves an F1-all error of 5.10%, a 16% error reduction from the best published result (6.10%). On Sintel (final pass), RAFT obtains an end-point-error of 2.855 pixels, a 30% error reduction from the best published result (4.098 pixels). In addition, RAFT has strong cross-dataset generalization as well as high efficiency in inference time, training speed, and parameter count. Code is available at https://github.com/princeton-vl/RAFT.

Zachary Teed, Jia Deng• 2020

Related benchmarks

TaskDatasetResultRank
Optical Flow EstimationKITTI 2015 (train)
Fl-epe0.63
431
Optical Flow EstimationMPI Sintel Final (train)
Endpoint Error (EPE)2.71
209
Optical Flow EstimationMPI Sintel Clean (train)
EPE1.43
202
Optical FlowSintel (train)
AEPE (Clean)0.76
179
Optical FlowSintel (test)
AEPE (Final)2.86
120
Optical FlowKITTI 2012 (train)--
115
Optical Flow EstimationSintel clean (test)
EPE1.61
103
Optical Flow EstimationSintel Final (test)
EPE2.71
101
Optical FlowKITTI 2015 (test)
Fl Error (All)5.1
95
Optical FlowSintel Final (train)
EPE1.2
92
Showing 10 of 100 rows
...

Other info

Code

Follow for update