How Noise Benefits AI-generated Image Detection
About
The rapid advancement of generative models has made real and synthetic images increasingly indistinguishable. Although extensive efforts have been devoted to detecting AI-generated images, out-of-distribution generalization remains a persistent challenge. We trace this weakness to spurious shortcuts exploited during training and we also observe that small feature-space perturbations can mitigate shortcut dominance. To address this problem in a more controllable manner, we propose the Positive-Incentive Noise for CLIP (PiN-CLIP), which jointly trains a noise generator and a detection network under a variational positive-incentive principle. Specifically, we construct positive-incentive noise in the feature space via cross-attention fusion of visual and categorical semantic features. During optimization, the noise is injected into the feature space to fine-tune the visual encoder, suppressing shortcut-sensitive directions while amplifying stable forensic cues, thereby enabling the extraction of more robust and generalized artifact representations. Comparative experiments are conducted on an open-world dataset comprising synthetic images generated by 42 distinct generative models. Our method achieves new state-of-the-art performance, with notable improvements of 5.4 in average accuracy over existing approaches.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| AI-generated image detection | Chameleon | Accuracy92.4 | 107 | |
| AIGI Detection | SynthWildx | DALLE3 Performance Score97 | 35 | |
| AI-generated image detection | WildRF | FB Score96.9 | 23 | |
| AI-generated image detection | Real-world Datasets Chameleon, SynthWildX, WildRF Aggregate | Accuracy95.8 | 11 | |
| AI-generated image detection | AIGCDetect 50 (test) | ProGAN Accuracy95.6 | 11 | |
| AI-generated image detection | AIGCDetect (test) | ProGAN Detection Rate99.9 | 11 | |
| AIGC Detection | Chameleon + SynthWildX + WildRF Average real-world (test) | Accuracy (JPEG QF=95)94.8 | 8 | |
| AIGC Detection | AIGCDetect (test) | Accuracy (JPEG QF=95)95.2 | 8 |