RAFT-Stereo: Multilevel Recurrent Field Transforms for Stereo Matching

About

We introduce RAFT-Stereo, a new deep architecture for rectified stereo based on the optical flow network RAFT. We introduce multi-level convolutional GRUs, which more efficiently propagate information across the image. A modified version of RAFT-Stereo can perform accurate real-time inference. RAFT-stereo ranks first on the Middlebury leaderboard, outperforming the next best method on 1px error by 29% and outperforms all published work on the ETH3D two-view stereo benchmark. Code is available at https://github.com/princeton-vl/RAFT-Stereo.

Lahav Lipson, Zachary Teed, Jia Deng• 2021

Related benchmarks

Task	Dataset	Result
Stereo Matching	KITTI 2015 (test)	D1 Error (Overall)1.82	245
Optical Flow Estimation	Sintel Final (test)	--	146
Stereo Matching	KITTI 2015	D1 Error (All)1.82	142
Optical Flow	KITTI 2015 (test)	--	122
Stereo Matching	KITTI 2012	Error Rate (3px, All)1.66	108
Stereo Matching	KITTI 2012 (test)	Outlier Rate (3px, Noc)1.3	105
Stereo Matching	ETH3D	bad 1.00.033	95
Stereo Matching	Scene Flow (test)	EPE0.55	84
Stereo Matching	Middlebury	Bad Pixel Rate (Thresh 2.0)9.37	84
Stereo Matching	Middlebury (test)	EPE1.27	60

Showing 10 of 78 rows

...

Other info

Code

Follow for update

@wizwand_team Discord