Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Revisiting Domain Generalized Stereo Matching Networks from a Feature Consistency Perspective

About

Despite recent stereo matching networks achieving impressive performance given sufficient training data, they suffer from domain shifts and generalize poorly to unseen domains. We argue that maintaining feature consistency between matching pixels is a vital factor for promoting the generalization capability of stereo matching networks, which has not been adequately considered. Here we address this issue by proposing a simple pixel-wise contrastive learning across the viewpoints. The stereo contrastive feature loss function explicitly constrains the consistency between learned features of matching pixel pairs which are observations of the same 3D points. A stereo selective whitening loss is further introduced to better preserve the stereo feature consistency across domains, which decorrelates stereo features from stereo viewpoint-specific style information. Counter-intuitively, the generalization of feature consistency between two viewpoints in the same scene translates to the generalization of stereo matching performance to unseen domains. Our method is generic in nature as it can be easily embedded into existing stereo networks and does not require access to the samples in the target domain. When trained on synthetic data and generalized to four real-world testing sets, our method achieves superior performance over several state-of-the-art networks.

Jiawei Zhang, Xiang Wang, Xiao Bai, Chen Wang, Lei Huang, Yimin Chen, Lin Gu, Jun Zhou, Tatsuya Harada, Edwin R. Hancock• 2022

Related benchmarks

TaskDatasetResultRank
Stereo MatchingKITTI 2015 (test)--
144
Stereo MatchingKITTI 2012 (test)--
76
Stereo MatchingMiddlebury (test)
3PE10.2
47
Stereo MatchingETH3D (test)
Error Rate (Th=1.0)5.8
30
Stereo MatchingKITTI 15
D1 Error (%)5.3
27
Stereo MatchingETH3D (train)
Bad 1.0 Rate5.8
23
Stereo MatchingMiddlebury half resolution (test)
Threshold Error Rate10.2
19
Stereo MatchingMiddlebury quarter resolution (test)
Threshold Error Rate7.8
19
Stereo MatchingMiddlebury half resolution (train)
Cosine Similarity98
12
Stereo MatchingMiddlebury half resolution 2014 (train)
Error Rate (2px)10.2
12
Showing 10 of 11 rows

Other info

Code

Follow for update