Arbitrary-Scale Image Synthesis

About

Positional encodings have enabled recent works to train a single adversarial network that can generate images of different scales. However, these approaches are either limited to a set of discrete scales or struggle to maintain good perceptual quality at the scales for which the model is not trained explicitly. We propose the design of scale-consistent positional encodings invariant to our generator's layers transformations. This enables the generation of arbitrary-scale images even at scales unseen during training. Moreover, we incorporate novel inter-scale augmentations into our pipeline and partial generation training to facilitate the synthesis of consistent images at arbitrary scales. Lastly, we show competitive results for a continuum of scales on various commonly used datasets for image synthesis.

Evangelos Ntavelis, Mohamad Shahbazi, Iason Kastanis, Radu Timofte, Martin Danelljan, Luc Van Gool• 2022

Related benchmarks

Task	Dataset	Result
Multi-scale Image Generation	FFHQ 1024x1024 (test)	Self-SSIM0.8942	24
Image Generation	LSUN bedroom	FID9.85	9
Image Synthesis	FFHQ 1024 (test)	FID (50k)10.91	9
Image Synthesis	FFHQ 512 (test)	FID6.23	3

Showing 4 of 4 rows

Other info

Code

Follow for update

@wizwand_team Discord