Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Iterative Geometry Encoding Volume for Stereo Matching

About

Recurrent All-Pairs Field Transforms (RAFT) has shown great potentials in matching tasks. However, all-pairs correlations lack non-local geometry knowledge and have difficulties tackling local ambiguities in ill-posed regions. In this paper, we propose Iterative Geometry Encoding Volume (IGEV-Stereo), a new deep network architecture for stereo matching. The proposed IGEV-Stereo builds a combined geometry encoding volume that encodes geometry and context information as well as local matching details, and iteratively indexes it to update the disparity map. To speed up the convergence, we exploit GEV to regress an accurate starting point for ConvGRUs iterations. Our IGEV-Stereo ranks $1^{st}$ on KITTI 2015 and 2012 (Reflective) among all published methods and is the fastest among the top 10 methods. In addition, IGEV-Stereo has strong cross-dataset generalization as well as high inference efficiency. We also extend our IGEV to multi-view stereo (MVS), i.e. IGEV-MVS, which achieves competitive accuracy on DTU benchmark. Code is available at https://github.com/gangweiX/IGEV.

Gangwei Xu, Xianqi Wang, Xiaohuan Ding, Xin Yang• 2023

Related benchmarks

TaskDatasetResultRank
Stereo MatchingKITTI 2015 (test)
D1 Error (Overall)1.59
144
Stereo MatchingKITTI 2015
D1 Error (All)1.59
118
Stereo MatchingKITTI 2012
Error Rate (3px, Noc)1.12
81
Stereo MatchingKITTI 2012 (test)
Outlier Rate (3px, Noc)1.12
76
Stereo MatchingScene Flow (test)
EPE0.47
70
Multi-view StereoDTU (test)--
61
Stereo MatchingETH3D
bad 1.00.036
51
Stereo MatchingMiddlebury (test)
3PE7.1
47
Stereo MatchingScene Flow
EPE (px)0.479
40
Stereo MatchingKITTI 2015 (all pixels)
D1 Error (Background)1.38
38
Showing 10 of 54 rows

Other info

Code

Follow for update