Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

RAFT: Recurrent All-Pairs Field Transforms for Optical Flow

About

We introduce Recurrent All-Pairs Field Transforms (RAFT), a new deep network architecture for optical flow. RAFT extracts per-pixel features, builds multi-scale 4D correlation volumes for all pairs of pixels, and iteratively updates a flow field through a recurrent unit that performs lookups on the correlation volumes. RAFT achieves state-of-the-art performance. On KITTI, RAFT achieves an F1-all error of 5.10%, a 16% error reduction from the best published result (6.10%). On Sintel (final pass), RAFT obtains an end-point-error of 2.855 pixels, a 30% error reduction from the best published result (4.098 pixels). In addition, RAFT has strong cross-dataset generalization as well as high efficiency in inference time, training speed, and parameter count. Code is available at https://github.com/princeton-vl/RAFT.

Zachary Teed, Jia Deng• 2020

Related benchmarks

TaskDatasetResultRank
Optical Flow EstimationKITTI 2015 (train)
Fl-epe0.63
446
Stereo MatchingKITTI 2015 (test)
D1 Error (Overall)2.42
233
Optical Flow EstimationMPI Sintel Final (train)
Endpoint Error (EPE)2.71
215
Optical Flow EstimationMPI Sintel Clean (train)
EPE1.43
208
Optical FlowSintel (train)
AEPE (Clean)0.76
200
Optical Flow EstimationSintel Final (test)
EPE2.71
133
Optical Flow EstimationSintel clean (test)
EPE1.61
120
Optical FlowSintel (test)
AEPE (Final)2.86
120
Optical FlowKITTI 2012 (train)--
115
Optical FlowSintel Final (train)
EPE1.2
112
Showing 10 of 136 rows
...

Other info

Code

Follow for update