Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Focus on defocus: bridging the synthetic to real domain gap for depth estimation

About

Data-driven depth estimation methods struggle with the generalization outside their training scenes due to the immense variability of the real-world scenes. This problem can be partially addressed by utilising synthetically generated images, but closing the synthetic-real domain gap is far from trivial. In this paper, we tackle this issue by using domain invariant defocus blur as direct supervision. We leverage defocus cues by using a permutation invariant convolutional neural network that encourages the network to learn from the differences between images with a different point of focus. Our proposed network uses the defocus map as an intermediate supervisory signal. We are able to train our model completely on synthetic data and directly apply it to a wide range of real-world images. We evaluate our model on synthetic and real datasets, showing compelling generalization results and state-of-the-art depth prediction.

Maxim Maximov, Kevin Galim, Laura Leal-Taix\'e• 2020

Related benchmarks

TaskDatasetResultRank
Depth EstimationNYU v2 (test)--
423
Depth EstimationNYU Depth V2
RMSE0.493
177
Depth PredictionSynthetic (test)
Delta 1 Accuracy65.7
9
Depth EstimationDDFF-12 (val)
MSE8.61e-4
6
Depth EstimationFoD500 (test)
MSE0.0218
6
Depth EstimationDDFF12
MSE8.60e-4
6
Depth EstimationDDFF 12-Scene
MSE9.10e-4
5
Depth EstimationNYU 45 Degrees (test)
RMSE0.073
5
Depth EstimationDefocusNet Regular (test)
δ1 Accuracy91.2
4
Depth EstimationDefocusNet (< 0.5m) (test)
Delta 1 Acc91.1
4
Showing 10 of 24 rows

Other info

Code

Follow for update