Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Where Detectors Fail: Probing Generative Space for Generalizable AI-Generated Image Detection

About

Detecting AI-generated images (AIGI) remains challenging because detectors often fail to generalize to unseen generators. Although existing methods are trained on large datasets, their performance still degrades when generation settings change, indicating that data scale alone is insufficient and that limited coverage of generative variations during training is a key factor. Studies on generative model editing show that small changes in internal representations can produce diverse and meaningful image variations, many of which are not explored under standard sampling. Leveraging this insight, we propose PROBE (Probing Robustness via Boundary Exploration), a framework that improves detector generalization by actively exploring challenging regions of the generative process. Instead of treating the generator as a fixed data source, PROBE uses the detector as a critic to steer the generator through manifold-level modifications, producing realistic samples that are difficult to classify. These samples expose failure cases that are uncommon under standard data sampling strategies and are used to refine the detector. Experimental results across multiple benchmarks indicate that PROBE enhances generalization to unseen generators, resulting in more generalizable AIGI detection performance. Code and models are available at https://github.com/Amamiya-C/PROBE-AIGI-Detection

Zijie Cao, Weijie Tu, Yao Xiao, Weijian Deng, Liang Lin, Pengxu Wei• 2026

Related benchmarks

TaskDatasetResultRank
AI-generated image detectionGenImage--
154
AI-generated image detectionWildRF--
36
AI-generated image detectionChameleon
B.Acc86.6
35
AIGC DetectionSynthWildx
Balanced Accuracy96.4
23
Synthetic Image DetectionSynthbuster
Balanced Accuracy97.5
23
AI-generated image detectionAIGI-Quality-Paradox
AP97.7
12
AI-generated image detectionChameleon
AP92.9
12
AI-generated image detectionSynthWildx
AP98.7
12
AI-generated image detectionAIGI-Bench
AP96.5
12
AI-generated image detectionAIGI-Bench
Balanced Accuracy (bAcc)89.9
12
Showing 10 of 11 rows

Other info

Follow for update