General and Domain-Specific Zero-shot Detection of Generated Images via Conditional Likelihood
About
The rapid advancement of generative models, particularly diffusion-based methods, has significantly improved the realism of synthetic images. As new generative models continuously emerge, detecting generated images remains a critical challenge. While fully supervised, and few-shot methods have been proposed, maintaining an updated dataset is time-consuming and challenging. Consequently, zero-shot methods have gained increasing attention in recent years. We find that existing zero-shot methods often struggle to adapt to specific image domains, such as artistic images, limiting their real-world applicability. In this work, we introduce CLIDE, a novel zero-shot detection method based on conditional likelihood approximation. Our approach computes likelihoods conditioned on real images, enabling adaptation across diverse image domains. We extensively evaluate CLIDE, demonstrating SOTA performance on a large-scale general dataset and significantly outperform existing methods in domain-specific cases. These results demonstrate the robustness of our method and underscore the need of broad, domain-aware generalization for the AI-generated image detection task. Code is available at https://tinyurl.com/clide-detector.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| AI Image Detection | Midjourney | Accuracy85 | 27 | |
| Synthetic Image Detection | Glide 50-27 | Accuracy84 | 27 | |
| AI-generated image detection | BigGAN | mAP96 | 17 | |
| AI-generated image detection | CycleGAN | mAP98 | 17 | |
| Generated Image Detection | Wukong | AP96 | 17 | |
| Generated Image Detection | ADM | AP0.88 | 17 | |
| AI-generated image detection | GauGAN | mAP85 | 17 | |
| AI-generated image detection | ProGAN | mAP96 | 17 | |
| AI-generated image detection | StyleGAN | mAP0.71 | 17 | |
| Synthetic Image Detection | Glide 100-27 | Accuracy83 | 14 |