Practical Deep Stereo (PDS): Toward applications-friendly deep stereo matching

About

End-to-end deep-learning networks recently demonstrated extremely good perfor- mance for stereo matching. However, existing networks are difficult to use for practical applications since (1) they are memory-hungry and unable to process even modest-size images, (2) they have to be trained for a given disparity range. The Practical Deep Stereo (PDS) network that we propose addresses both issues: First, its architecture relies on novel bottleneck modules that drastically reduce the memory footprint in inference, and additional design choices allow to handle greater image size during training. This results in a model that leverages large image context to resolve matching ambiguities. Second, a novel sub-pixel cross- entropy loss combined with a MAP estimator make this network less sensitive to ambiguous matches, and applicable to any disparity range without re-training. We compare PDS to state-of-the-art methods published over the recent months, and demonstrate its superior performance on FlyingThings3D and KITTI sets.

Stepan Tulyakov, Anton Ivanov, Francois Fleuret• 2018

Related benchmarks

Task	Dataset	Result
Stereo Matching	KITTI 2015 (test)	D1 Error (Overall)2.58	233
Stereo Matching	KITTI 2012 (test)	Outlier Rate (3px, Noc)1.92	105
Disparity Estimation	Scene Flow (test)	EPE1.12	24

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord