Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

RAFT-Stereo: Multilevel Recurrent Field Transforms for Stereo Matching

About

We introduce RAFT-Stereo, a new deep architecture for rectified stereo based on the optical flow network RAFT. We introduce multi-level convolutional GRUs, which more efficiently propagate information across the image. A modified version of RAFT-Stereo can perform accurate real-time inference. RAFT-stereo ranks first on the Middlebury leaderboard, outperforming the next best method on 1px error by 29% and outperforms all published work on the ETH3D two-view stereo benchmark. Code is available at https://github.com/princeton-vl/RAFT-Stereo.

Lahav Lipson, Zachary Teed, Jia Deng• 2021

Related benchmarks

TaskDatasetResultRank
Stereo MatchingKITTI 2015 (test)
D1 Error (Overall)1.82
205
Optical Flow EstimationSintel Final (test)--
133
Stereo MatchingKITTI 2015
D1 Error (All)1.82
118
Optical FlowKITTI 2015 (test)--
109
Stereo MatchingKITTI 2012
Error Rate (3px, All)1.66
108
Stereo MatchingKITTI 2012 (test)
Outlier Rate (3px, Noc)1.3
89
Stereo MatchingScene Flow (test)
EPE0.55
77
Stereo MatchingETH3D
bad 1.00.033
51
Stereo MatchingETH3D
Threshold Error > 1px (Noc)2.24
50
Stereo MatchingKITTI 2015 (all pixels)
D1 Error (Background)1.58
48
Showing 10 of 69 rows

Other info

Code

Follow for update