Contrastive Learning for Unpaired Image-to-Image Translation

About

In image-to-image translation, each patch in the output should reflect the content of the corresponding patch in the input, independent of domain. We propose a straightforward method for doing so -- maximizing mutual information between the two, using a framework based on contrastive learning. The method encourages two elements (corresponding patches) to map to a similar point in a learned feature space, relative to other elements (other patches) in the dataset, referred to as negatives. We explore several critical design choices for making contrastive learning effective in the image synthesis setting. Notably, we use a multilayer, patch-based approach, rather than operate on entire images. Furthermore, we draw negatives from within the input image itself, rather than from the rest of the dataset. We demonstrate that our framework enables one-sided translation in the unpaired image-to-image translation setting, while improving quality and reducing training time. In addition, our method can even be extended to the training setting where each "domain" is only a single image.

Taesung Park, Alexei A. Efros, Richard Zhang, Jun-Yan Zhu• 2020

Related benchmarks

Task	Dataset	Result
Semantic Image Synthesis	ADE20K	FID79.1	66
Object Detection	BDD100K (Nighttime)	AP14.1	66
Image Dehazing	SOTS outdoor RESIDE (test)	PSNR23.67	57
Semantic Image Synthesis	Cityscapes	FID57.3	54
Image Dehazing	SOTS indoor RESIDE (test)	PSNR24.3	49
Semantic Image Synthesis	COCO Stuff	FID85.6	49
Virtual Staining	MIST-HER2	SSIM0.1698	46
Brain Tissue Segmentation	iSeg 2019 (test)	Dice (CSF)94.44	28
Image-to-Image Translation	CD3 (test)	PSNR19.5	28
Virtual Staining	IHC(CK8/18) (test)	PSNR19.39	27

Showing 10 of 123 rows

...

Other info

Follow for update

@wizwand_team Discord