Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Practical Deep Stereo (PDS): Toward applications-friendly deep stereo matching

About

End-to-end deep-learning networks recently demonstrated extremely good perfor- mance for stereo matching. However, existing networks are difficult to use for practical applications since (1) they are memory-hungry and unable to process even modest-size images, (2) they have to be trained for a given disparity range. The Practical Deep Stereo (PDS) network that we propose addresses both issues: First, its architecture relies on novel bottleneck modules that drastically reduce the memory footprint in inference, and additional design choices allow to handle greater image size during training. This results in a model that leverages large image context to resolve matching ambiguities. Second, a novel sub-pixel cross- entropy loss combined with a MAP estimator make this network less sensitive to ambiguous matches, and applicable to any disparity range without re-training. We compare PDS to state-of-the-art methods published over the recent months, and demonstrate its superior performance on FlyingThings3D and KITTI sets.

Stepan Tulyakov, Anton Ivanov, Francois Fleuret• 2018

Related benchmarks

TaskDatasetResultRank
Stereo MatchingKITTI 2015 (test)
D1 Error (Overall)2.58
144
Stereo MatchingKITTI 2012 (test)
Outlier Rate (3px, Noc)1.92
76
Disparity EstimationScene Flow (test)
EPE1.12
24
Showing 3 of 3 rows

Other info

Follow for update