Fourier Spectrum Discrepancies in Deep Network Generated Images

About

Advancements in deep generative models such as generative adversarial networks and variational autoencoders have resulted in the ability to generate realistic images that are visually indistinguishable from real images, which raises concerns about their potential malicious usage. In this paper, we present an analysis of the high-frequency Fourier modes of real and deep network generated images and show that deep network generated images share an observable, systematic shortcoming in replicating the attributes of these high-frequency modes. Using this, we propose a detection method based on the frequency spectrum of the images which is able to achieve an accuracy of up to 99.2% in classifying real and deep network generated images from various GAN and VAE architectures on a dataset of 5000 images with as few as 8 training examples. Furthermore, we show the impact of image transformations such as compression, cropping, and resolution reduction on the classification accuracy and suggest a method for modifying the high-frequency attributes of deep network generated images to mimic real images.

Tarik Dzanic, Karan Shah, Freddie Witherden• 2019

Related benchmarks

Task	Dataset	Result
Model Attribution	GM-FFHQ (test)	Accuracy55.7	12
Model Attribution	GM-FFHQ to GM-CelebA-HQ	Accuracy42.5	12
Model Attribution	GM-CIFAR10 (test)	Accuracy56.123	12
Model Attribution	GM-CelebA (test)	Accuracy61.6	12
Model Attribution	GM-CelebA to CIFAR10	Accuracy54.7	12
Model Attribution	GM-CHQ (test)	Accuracy56.9	12
Model Attribution	GM-CIFAR10 to GM-CelebA	Accuracy56.9	12
Model Attribution	GM-CelebA-HQ to GM-FFHQ	Accuracy45.2	12
Model Attribution	GM-GenImage	Accuracy82.13	7
T2I model identification	Leaderboard dataset (unseen prompts)	Top-1 Acc11.79	6

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord