Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reversing the cycle: self-supervised deep stereo through enhanced monocular distillation

About

In many fields, self-supervised learning solutions are rapidly evolving and filling the gap with supervised approaches. This fact occurs for depth estimation based on either monocular or stereo, with the latter often providing a valid source of self-supervision for the former. In contrast, to soften typical stereo artefacts, we propose a novel self-supervised paradigm reversing the link between the two. Purposely, in order to train deep stereo networks, we distill knowledge through a monocular completion network. This architecture exploits single-image clues and few sparse points, sourced by traditional stereo algorithms, to estimate dense yet accurate disparity maps by means of a consensus mechanism over multiple estimations. We thoroughly evaluate with popular stereo datasets the impact of different supervisory signals showing how stereo networks trained with our paradigm outperform existing self-supervised frameworks. Finally, our proposal achieves notable generalization capabilities dealing with domain shift issues. Code available at https://github.com/FilippoAleotti/Reversing

Filippo Aleotti, Fabio Tosi, Li Zhang, Matteo Poggi, Stefano Mattoccia• 2020

Related benchmarks

TaskDatasetResultRank
Stereo MatchingKITTI 15
D1 Error (%)4.09
27
Stereo Depth EstimationKITTI 2015 (train)
Acc Threshold 197.4
12
Stereo Depth EstimationKITTI 2015 (test)
D1 Error (noc)3.86
6
Showing 3 of 3 rows

Other info

Follow for update