U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation

About

We propose a novel method for unsupervised image-to-image translation, which incorporates a new attention module and a new learnable normalization function in an end-to-end manner. The attention module guides our model to focus on more important regions distinguishing between source and target domains based on the attention map obtained by the auxiliary classifier. Unlike previous attention-based method which cannot handle the geometric changes between domains, our model can translate both images requiring holistic changes and images requiring large shape changes. Moreover, our new AdaLIN (Adaptive Layer-Instance Normalization) function helps our attention-guided model to flexibly control the amount of change in shape and texture by learned parameters depending on datasets. Experimental results show the superiority of the proposed method compared to the existing state-of-the-art models with a fixed network architecture and hyper-parameters. Our code and datasets are available at https://github.com/taki0112/UGATIT or https://github.com/znxlwm/UGATIT-pytorch.

Junho Kim, Minjae Kim, Hyeonwoo Kang, Kwanghee Lee• 2019

Related benchmarks

Task	Dataset	Result
Image-to-Image Translation	Retinal Fundus-to-Angiogram (test)	FID24.5	42
Image-to-Image Translation	CD3 (test)	PSNR19.21	28
Virtual Staining	IHC(CK8/18) (test)	PSNR19.82	27
Virtual Staining	HEMIT 13 (full dataset)	PSNR25.19	24
Image-to-Image Translation	Edges to Rotated Shoes (test)	LPIPS0.56	12
Image-to-Image Translation	selfie2anime	KID0.1161	11
Image-to-Image Translation	anime2selfie	KID0.1152	10
Image-to-Image Translation	portrait2photo	KID1.69	10
Sketch-to-Photo Generation	Chair V2	FID107.2	8
Sketch-to-Photo Generation	Handbag	FID127.5	8

Showing 10 of 48 rows

Other info

Code

Follow for update

@wizwand_team Discord