Geometry-Consistent Generative Adversarial Networks for One-Sided Unsupervised Domain Mapping

About

Unsupervised domain mapping aims to learn a function to translate domain X to Y by a function GXY in the absence of paired examples. Finding the optimal GXY without paired data is an ill-posed problem, so appropriate constraints are required to obtain reasonable solutions. One of the most prominent constraints is cycle consistency, which enforces the translated image by GXY to be translated back to the input image by an inverse mapping GYX. While cycle consistency requires the simultaneous training of GXY and GY X, recent studies have shown that one-sided domain mapping can be achieved by preserving pairwise distances between images. Although cycle consistency and distance preservation successfully constrain the solution space, they overlook the special properties that simple geometric transformations do not change the semantic structure of images. Based on this special property, we develop a geometry-consistent generative adversarial network (GcGAN), which enables one-sided unsupervised domain mapping. GcGAN takes the original image and its counterpart image transformed by a predefined geometric transformation as inputs and generates two images in the new domain coupled with the corresponding geometry-consistency constraint. The geometry-consistency constraint reduces the space of possible solutions while keep the correct solutions in the search space. Quantitative and qualitative comparisons with the baseline (GAN alone) and the state-of-the-art methods including CycleGAN and DistanceGAN demonstrate the effectiveness of our method.

Huan Fu, Mingming Gong, Chaohui Wang, Kayhan Batmanghelich, Kun Zhang, Dacheng Tao• 2018

Related benchmarks

Task	Dataset	Result
Semantic Image Synthesis	ADE20K	FID92	66
Semantic Image Synthesis	Cityscapes	FID80	54
Semantic Image Synthesis	COCO Stuff	FID99.8	49
Image-to-Image Translation	Horse -> Zebra	FID74.89	23
Photo to label translation	Cityscapes	Pixel Acc0.583	18
Unpaired Image-to-Image Translation	Cat → Dog v1 (test)	FID96.6	14
Medical Image Synthesis	UPenn-GBM T1c	PSNR30.75	8
Medical Image Synthesis	UPenn-GBM T2f	PSNR30.41	8
Medical Image Synthesis	UPenn-GBM (Avg)	PSNR30.58	8
Unpaired Image-to-Image Translation	Cityscapes	Pixel Accuracy65.5	8

Showing 10 of 19 rows

Other info

Follow for update

@wizwand_team Discord