SharinGAN: Combining Synthetic and Real Data for Unsupervised Geometry Estimation
About
We propose a novel method for combining synthetic and real images when training networks to determine geometric information from a single image. We suggest a method for mapping both image types into a single, shared domain. This is connected to a primary network for end-to-end training. Ideally, this results in images from two domains that present shared information to the primary network. Our experiments demonstrate significant improvements over the state-of-the-art in two important domains, surface normal estimation of human faces and monocular depth estimation for outdoor scenes, both in an unsupervised setting.
Koutilya PNVR, Hao Zhou, David Jacobs• 2020
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Monocular Depth Estimation | Make3D (test) | Abs Rel0.377 | 132 | |
| Monocular Depth Estimation | KITTI 80m maximum depth (Eigen) | Abs Rel0.116 | 126 | |
| Face Normal Estimation | Photoface (test) | MAE24 | 32 | |
| Depth Estimation | KITTI 50m cap (test) | Abs Rel0.109 | 24 | |
| Monocular Depth Estimation | KITTI Raw (KR) Eigen 80m (test) | Abs Rel Error0.116 | 20 | |
| Monocular Depth Estimation | KITTI 50m cap Eigen split (test) | Absolute Relative Error0.109 | 19 | |
| Depth Estimation | KITTI Eigen split 80m cap | Abs Rel Error0.116 | 18 | |
| Monocular Depth Estimation | KITTI Stereo 2015 (test) | Abs Rel0.092 | 9 | |
| Light classification | MultiPIE 15 (test) | Top-1 Accuracy81.83 | 2 |
Showing 9 of 9 rows