SharinGAN: Combining Synthetic and Real Data for Unsupervised Geometry Estimation

About

We propose a novel method for combining synthetic and real images when training networks to determine geometric information from a single image. We suggest a method for mapping both image types into a single, shared domain. This is connected to a primary network for end-to-end training. Ideally, this results in images from two domains that present shared information to the primary network. Our experiments demonstrate significant improvements over the state-of-the-art in two important domains, surface normal estimation of human faces and monocular depth estimation for outdoor scenes, both in an unsupervised setting.

Koutilya PNVR, Hao Zhou, David Jacobs• 2020

Related benchmarks

Task	Dataset	Result
Monocular Depth Estimation	Make3D (test)	Abs Rel0.377	143
Monocular Depth Estimation	KITTI 80m maximum depth (Eigen)	Abs Rel0.116	126
Face Normal Estimation	Photoface (test)	MAE24	32
Depth Estimation	KITTI 50m cap (test)	Abs Rel0.109	24
Monocular Depth Estimation	KITTI Raw (KR) Eigen 80m (test)	Abs Rel Error0.116	20
Monocular Depth Estimation	KITTI 50m cap Eigen split (test)	Absolute Relative Error0.109	19
Depth Estimation	KITTI Eigen split 80m cap	Abs Rel Error0.116	18
Monocular Depth Estimation	KITTI Stereo 2015 (test)	Abs Rel0.092	9
Light classification	MultiPIE 15 (test)	Top-1 Accuracy81.83	2

Showing 9 of 9 rows

Other info

Code

Follow for update

@wizwand_team Discord