HEDGE: Heterogeneous Ensemble for Detection of AI-GEnerated Images in the Wild
About
Robust detection of AI-generated images in the wild remains challenging due to the rapid evolution of generative models and varied real-world distortions. We argue that relying on a single training regime, resolution, or backbone is insufficient to handle all conditions, and that structured heterogeneity across these dimensions is essential for robust detection. To this end, we propose HEDGE, a Heterogeneous Ensemble for Detection of AI-GEnerated images, that introduces complementary detection routes along three axes: diverse training data with strong augmentation, multi-scale feature extraction, and backbone heterogeneity. Specifically, Route~A progressively constructs DINOv3-based detectors through staged data expansion and augmentation escalation, Route~B incorporates a higher-resolution branch for fine-grained forensic cues, and Route~C adds a MetaCLIP2-based branch for backbone diversity. All outputs are fused via logit-space weighted averaging, refined by a lightweight dual-gating mechanism that handles branch-level outliers and majority-dominated fusion errors. HEDGE achieves 4th place in the NTIRE 2026 Robust AI-Generated Image Detection in the Wild Challenge and attains state-of-the-art performance with strong robustness on multiple AIGC image detection benchmarks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| AI-generated image detection | GenImage | -- | 106 | |
| AIGI Detection | SynthWildx | DALLE3 Performance Score97.9 | 35 | |
| AIGI Detection | BFree Online | B.Acc82.1 | 35 | |
| AIGI Detection | DRCT-2M | B.Acc95 | 35 | |
| AIGI Detection | AIGCDetect | B.Acc99.5 | 24 | |
| AIGI Detection | RRDataset | B.Acc99.9 | 24 | |
| AIGI Detection | Synthbuster | B.Acc97.3 | 24 | |
| AI-generated image detection | WildRF | FB Score99.3 | 23 | |
| AI-generated image detection | RealChain Chain Degradations | R.Acc98.7 | 21 | |
| AI-generated image detection | Chameleon | B.Acc99.9 | 12 |