Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Fourier Spectrum Discrepancies in Deep Network Generated Images

About

Advancements in deep generative models such as generative adversarial networks and variational autoencoders have resulted in the ability to generate realistic images that are visually indistinguishable from real images, which raises concerns about their potential malicious usage. In this paper, we present an analysis of the high-frequency Fourier modes of real and deep network generated images and show that deep network generated images share an observable, systematic shortcoming in replicating the attributes of these high-frequency modes. Using this, we propose a detection method based on the frequency spectrum of the images which is able to achieve an accuracy of up to 99.2% in classifying real and deep network generated images from various GAN and VAE architectures on a dataset of 5000 images with as few as 8 training examples. Furthermore, we show the impact of image transformations such as compression, cropping, and resolution reduction on the classification accuracy and suggest a method for modifying the high-frequency attributes of deep network generated images to mimic real images.

Tarik Dzanic, Karan Shah, Freddie Witherden• 2019

Related benchmarks

TaskDatasetResultRank
Model AttributionGM-FFHQ (test)
Accuracy55.7
12
Model AttributionGM-FFHQ to GM-CelebA-HQ
Accuracy42.5
12
Model AttributionGM-CIFAR10 (test)
Accuracy56.123
12
Model AttributionGM-CelebA (test)
Accuracy61.6
12
Model AttributionGM-CelebA to CIFAR10
Accuracy54.7
12
Model AttributionGM-CHQ (test)
Accuracy56.9
12
Model AttributionGM-CIFAR10 to GM-CelebA
Accuracy56.9
12
Model AttributionGM-CelebA-HQ to GM-FFHQ
Accuracy45.2
12
T2I model identificationLeaderboard dataset (unseen prompts)
Top-1 Acc11.79
6
Showing 9 of 9 rows

Other info

Follow for update