Watch your Up-Convolution: CNN Based Generative Deep Neural Networks are Failing to Reproduce Spectral Distributions

About

Generative convolutional deep neural networks, e.g. popular GAN architectures, are relying on convolution based up-sampling methods to produce non-scalar outputs like images or video sequences. In this paper, we show that common up-sampling methods, i.e. known as up-convolution or transposed convolution, are causing the inability of such models to reproduce spectral distributions of natural training data correctly. This effect is independent of the underlying architecture and we show that it can be used to easily detect generated data like deepfakes with up to 100% accuracy on public benchmarks. To overcome this drawback of current generative models, we propose to add a novel spectral regularization term to the training optimization objective. We show that this approach not only allows to train spectral consistent GANs that are avoiding high frequency errors. Also, we show that a correct approximation of the frequency spectrum has positive effects on the training stability and output quality of generative networks.

Ricard Durall, Margret Keuper, Janis Keuper• 2020

Related benchmarks

Task	Dataset	Result
Synthetic Image Detection	GANs dataset	Mean ACC70.2	40
Synthetic Image Detection	Glide 50-27	Accuracy51.7	37
AI-generated image detection	Ojha Diffusion Benchmark 1.0 (test)	DALL-E Acc55.9	32
Synthetic Image Detection	Glide 100-10	Accuracy54.9	24
Synthetic Image Detection	Glide 100-27	Accuracy0.489	24
Model Attribution	GM-FFHQ (test)	Accuracy60.9	12
Model Attribution	GM-CelebA (test)	Accuracy62.2	12
Model Attribution	GM-CHQ (test)	Accuracy59.1	12
Model Attribution	GM-FFHQ to GM-CelebA-HQ	Accuracy42.6	12
Model Attribution	GM-CIFAR10 (test)	Accuracy57.293	12

Showing 10 of 37 rows

Other info

Follow for update

@wizwand_team Discord