Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Wasserstein Distances for Stereo Disparity Estimation

About

Existing approaches to depth or disparity estimation output a distribution over a set of pre-defined discrete values. This leads to inaccurate results when the true depth or disparity does not match any of these values. The fact that this distribution is usually learned indirectly through a regression loss causes further problems in ambiguous regions around object boundaries. We address these issues using a new neural network architecture that is capable of outputting arbitrary depth values, and a new loss function that is derived from the Wasserstein distance between the true and the predicted distributions. We validate our approach on a variety of tasks, including stereo disparity and depth estimation, and the downstream 3D object detection. Our approach drastically reduces the error in ambiguous regions, especially around object boundaries that greatly affect the localization of objects in 3D, achieving the state-of-the-art in 3D object detection for autonomous driving. Our code will be available at https://github.com/Div99/W-Stereo-Disp.

Divyansh Garg, Yan Wang, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger, Wei-Lun Chao• 2020

Related benchmarks

TaskDatasetResultRank
3D Object DetectionKITTI car (test)
AP3D (Easy)74.5
195
Stereo MatchingKITTI 2015 (test)
D1 Error (Overall)1.92
144
Bird's Eye View Object Detection (Car)KITTI (test)
APBEV (Easy) @IoU=0.783.32
59
3D Object Detection (Car)KITTI (test)
AP3D (Easy) @ IoU=0.774.52
36
3D Object DetectionKITTI official (test)
APBEV (Easy)83.32
19
2D Car DetectionKITTI (test)
AP2D Easy95.85
14
Stereo DisparityKITTI 2015
3PE (Non Occlusion Foreground)2.79
12
Stereo MatchingMiddlebury half resolution (train)
Cosine Similarity61
12
Stereo DisparityScene Flow
EPE0.7
11
Showing 9 of 9 rows

Other info

Code

Follow for update