Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HITNet: Hierarchical Iterative Tile Refinement Network for Real-time Stereo Matching

About

This paper presents HITNet, a novel neural network architecture for real-time stereo matching. Contrary to many recent neural network approaches that operate on a full cost volume and rely on 3D convolutions, our approach does not explicitly build a volume and instead relies on a fast multi-resolution initialization step, differentiable 2D geometric propagation and warping mechanisms to infer disparity hypotheses. To achieve a high level of accuracy, our network not only geometrically reasons about disparities but also infers slanted plane hypotheses allowing to more accurately perform geometric warping and upsampling operations. Our architecture is inherently multi-resolution allowing the propagation of information across different levels. Multiple experiments prove the effectiveness of the proposed approach at a fraction of the computation required by state-of-the-art methods. At the time of writing, HITNet ranks 1st-3rd on all the metrics published on the ETH3D website for two view stereo, ranks 1st on most of the metrics among all the end-to-end learning approaches on Middlebury-v3, ranks 1st on the popular KITTI 2012 and 2015 benchmarks among the published methods faster than 100ms.

Vladimir Tankovich, Christian H\"ane, Yinda Zhang, Adarsh Kowdle, Sean Fanello, Sofien Bouaziz• 2020

Related benchmarks

TaskDatasetResultRank
Stereo MatchingKITTI 2015 (test)
D1 Error (Overall)0.0198
144
Stereo MatchingKITTI 2015
D1 Error (All)1.98
118
Stereo MatchingKITTI 2012
Error Rate (3px, Noc)0.0141
81
Stereo MatchingKITTI 2012 (test)
Outlier Rate (3px, Noc)1.41
76
Stereo MatchingScene Flow (test)
EPE0.43
70
Stereo MatchingETH3D
bad 1.03.11
51
Stereo MatchingMiddlebury (test)--
47
Stereo MatchingScene Flow
EPE (px)0.55
40
Stereo MatchingKITTI 2015 (all pixels)
D1 Error (Background)1.74
38
Stereo MatchingMiddlebury
Bad Pixel Rate (Thresh 2.0)12.8
34
Showing 10 of 26 rows

Other info

Code

Follow for update