Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Towards Universal Fake Image Detectors that Generalize Across Generative Models

About

With generative models proliferating at a rapid rate, there is a growing need for general purpose fake image detectors. In this work, we first show that the existing paradigm, which consists of training a deep network for real-vs-fake classification, fails to detect fake images from newer breeds of generative models when trained to detect GAN fake images. Upon analysis, we find that the resulting classifier is asymmetrically tuned to detect patterns that make an image fake. The real class becomes a sink class holding anything that is not fake, including generated images from models not accessible during training. Building upon this discovery, we propose to perform real-vs-fake classification without learning; i.e., using a feature space not explicitly trained to distinguish real from fake images. We use nearest neighbor and linear probing as instantiations of this idea. When given access to the feature space of a large pretrained vision-language model, the very simple baseline of nearest neighbor classification has surprisingly good generalization ability in detecting fake images from a wide variety of generative models; e.g., it improves upon the SoTA by +15.07 mAP and +25.90% acc when tested on unseen diffusion and autoregressive models.

Utkarsh Ojha, Yuheng Li, Yong Jae Lee• 2023

Related benchmarks

TaskDatasetResultRank
Generated Image DetectionGenImage (test)
Average Accuracy88.8
103
Artifact DetectionOpenMMSec
Deepfake EFS88.5
68
AI-generated image detectionGenImage
Midjourney Detection Rate93.9
65
AI-generated image detectionChameleon
Accuracy59.11
63
AI-generated image detectionChameleon (test)
Accuracy57.22
54
Deepfake DetectionUniversalFakeDetect 1.0 (test)
Accuracy (ProGAN)100
42
Synthetic Image DetectionForenSynths (test)
Mean Accuracy89.4
31
Synthetic Image DetectionGlide 50-27
Accuracy91.1
27
AI Image DetectionMidjourney
Accuracy56.1
27
Forgery ClassificationAverage across 7 Sets (CASIA2, NIST16, DEFACTO, ForenSynths, DiffusionDB, Midjourney, RAISE) (test)
Accuracy80.6
25
Showing 10 of 282 rows
...

Other info

Follow for update