Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Rethinking Cross-Generator Image Forgery Detection through DINOv3

About

As generative models become increasingly diverse and powerful, cross-generator detection has emerged as a new challenge. Existing detection methods often memorize artifacts of specific generative models rather than learning transferable cues, leading to substantial failures on unseen generators. Surprisingly, this work finds that frozen visual foundation models, especially DINOv3, already exhibit strong cross-generator detection capability without any fine-tuning. Through systematic studies on frequency, spatial, and token perspectives, we observe that DINOv3 tends to rely on global, low-frequency structures as weak but transferable authenticity cues instead of high-frequency, generator-specific artifacts. Motivated by this insight, we introduce a simple, training-free token-ranking strategy followed by a lightweight linear probe to select a small subset of authenticity-relevant tokens. This token subset consistently improves detection accuracy across all evaluated datasets. Our study provides empirical evidence and a feasible hypothesis for understanding why foundation models generalize across diverse generators, offering a universal, efficient, and interpretable baseline for image forgery detection.

Zhenglin Huang, Jason Li, Haiquan Wen, Tianxiao Li, Xi Yang, Lu Qi, Bei Peng, Xiaowei Huang, Ming-Hsuan Yang, Guangliang Cheng• 2025

Related benchmarks

TaskDatasetResultRank
AI-generated image detectionGenImage
Midjourney Detection Rate85.7
65
Image Forgery DetectionAIGCDetectionBenchmark (test)
Detection Score (ProGAN)99.59
13
Forgery DetectionSo-Fake OOD (test)
Accuracy87.5
9
AI-generated image detectionSo-Fake OOD
Flux.1 pro Accuracy79.9
8
Showing 4 of 4 rows

Other info

Follow for update