Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GA-Net: Guided Aggregation Net for End-to-end Stereo Matching

About

In the stereo matching task, matching cost aggregation is crucial in both traditional methods and deep neural network models in order to accurately estimate disparities. We propose two novel neural net layers, aimed at capturing local and the whole-image cost dependencies respectively. The first is a semi-global aggregation layer which is a differentiable approximation of the semi-global matching, the second is the local guided aggregation layer which follows a traditional cost filtering strategy to refine thin structures. These two layers can be used to replace the widely used 3D convolutional layer which is computationally costly and memory-consuming as it has cubic computational/memory complexity. In the experiments, we show that nets with a two-layer guided aggregation block easily outperform the state-of-the-art GC-Net which has nineteen 3D convolutional layers. We also train a deep guided aggregation network (GA-Net) which gets better accuracies than state-of-the-art methods on both Scene Flow dataset and KITTI benchmarks.

Feihu Zhang, Victor Prisacariu, Ruigang Yang, Philip H.S. Torr• 2019

Related benchmarks

TaskDatasetResultRank
Stereo MatchingKITTI 2015 (test)
D1 Error (Overall)1.81
233
Stereo MatchingKITTI 2015
D1 Error (All)1.81
118
Stereo MatchingKITTI 2012
Error Rate (3px, All)0.016
108
Stereo MatchingKITTI 2012 (test)
Outlier Rate (3px, Noc)1.19
105
Stereo MatchingScene Flow (test)
EPE0.84
84
Disparity EstimationKITTI 2015 (test)
D1 Error (bg, all)1.55
77
Stereo MatchingMiddlebury (test)--
60
Stereo MatchingETH3D
bad 1.00.065
57
Stereo MatchingMiddlebury
Bad Pixel Rate (Thresh 2.0)18.79
53
Stereo MatchingKITTI 2015 (all pixels)
D1 Error (Background)1.48
48
Showing 10 of 44 rows

Other info

Code

Follow for update