Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

RAFT-Stereo: Multilevel Recurrent Field Transforms for Stereo Matching

About

We introduce RAFT-Stereo, a new deep architecture for rectified stereo based on the optical flow network RAFT. We introduce multi-level convolutional GRUs, which more efficiently propagate information across the image. A modified version of RAFT-Stereo can perform accurate real-time inference. RAFT-stereo ranks first on the Middlebury leaderboard, outperforming the next best method on 1px error by 29% and outperforms all published work on the ETH3D two-view stereo benchmark. Code is available at https://github.com/princeton-vl/RAFT-Stereo.

Lahav Lipson, Zachary Teed, Jia Deng• 2021

Related benchmarks

TaskDatasetResultRank
Stereo MatchingKITTI 2015 (test)
D1 Error (Overall)1.82
233
Optical Flow EstimationSintel Final (test)--
133
Stereo MatchingKITTI 2015
D1 Error (All)1.82
118
Optical FlowKITTI 2015 (test)--
109
Stereo MatchingKITTI 2012
Error Rate (3px, All)1.66
108
Stereo MatchingKITTI 2012 (test)
Outlier Rate (3px, Noc)1.3
105
Stereo MatchingScene Flow (test)
EPE0.55
84
Stereo MatchingMiddlebury (test)
EPE1.27
60
Stereo MatchingETH3D
bad 1.00.033
57
Stereo MatchingMiddlebury
Bad Pixel Rate (Thresh 2.0)9.37
53
Showing 10 of 73 rows
...

Other info

Code

Follow for update