Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

RAFT: Recurrent All-Pairs Field Transforms for Optical Flow

About

We introduce Recurrent All-Pairs Field Transforms (RAFT), a new deep network architecture for optical flow. RAFT extracts per-pixel features, builds multi-scale 4D correlation volumes for all pairs of pixels, and iteratively updates a flow field through a recurrent unit that performs lookups on the correlation volumes. RAFT achieves state-of-the-art performance. On KITTI, RAFT achieves an F1-all error of 5.10%, a 16% error reduction from the best published result (6.10%). On Sintel (final pass), RAFT obtains an end-point-error of 2.855 pixels, a 30% error reduction from the best published result (4.098 pixels). In addition, RAFT has strong cross-dataset generalization as well as high efficiency in inference time, training speed, and parameter count. Code is available at https://github.com/princeton-vl/RAFT.

Zachary Teed, Jia Deng• 2020

Related benchmarks

TaskDatasetResultRank
Optical Flow EstimationKITTI 2015 (train)
Fl-epe0.63
446
Optical Flow EstimationMPI Sintel Final (train)
Endpoint Error (EPE)2.71
209
Stereo MatchingKITTI 2015 (test)
D1 Error (Overall)2.42
205
Optical Flow EstimationMPI Sintel Clean (train)
EPE1.43
202
Optical FlowSintel (train)
AEPE (Clean)0.76
200
Optical Flow EstimationSintel Final (test)
EPE2.71
133
Optical Flow EstimationSintel clean (test)
EPE1.61
120
Optical FlowSintel (test)
AEPE (Final)2.86
120
Optical FlowKITTI 2012 (train)--
115
Optical FlowKITTI 2015 (test)
Fl Error (All)5.1
109
Showing 10 of 132 rows
...

Other info

Code

Follow for update