Universal Anti-forensics Attack against Image Forgery Detection via Multi-modal Guidance
About
The rapid advancement of AI-Generated Content (AIGC) technologies poses significant challenges for authenticity assessment. However, existing evaluation protocols largely overlook anti-forensics attack, failing to ensure the comprehensive robustness of state-of-the-art AIGC detectors in real-world applications. To bridge this gap, we propose ForgeryEraser, a framework designed to execute universal anti-forensics attack without access to the target AIGC detectors. We reveal an adversarial vulnerability stemming from the systemic reliance on Vision-Language Models (VLMs) as shared backbones (e.g., CLIP), where downstream AIGC detectors inherit the feature space of these publicly accessible models. Instead of traditional logit-based optimization, we design a multi-modal guidance loss to drive forged image embeddings within the VLM feature space toward text-derived authentic anchors to erase forgery traces, while repelling them from forgery anchors. Extensive experiments demonstrate that ForgeryEraser causes substantial performance degradation to advanced AIGC detectors on both global synthesis and local editing benchmarks. Moreover, ForgeryEraser induces explainable forensic models to generate explanations consistent with authentic images for forged images. Our code will be made publicly available.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Local Editing Detection | Real Images Protocol-1 | Accuracy97.9 | 6 | |
| Local Editing Detection | Fake Images Protocol-1 | Accuracy9.4 | 6 | |
| Global Synthesis Detection | SID-Set FullSync Real Images | Accuracy99.4 | 3 | |
| Global Synthesis Detection | AIGCDetectBenchmark Real Images | Accuracy99.4 | 3 | |
| Global Synthesis Detection | FakeClue Real Images | Accuracy97.5 | 3 | |
| Global Synthesis Detection | UniversalFakeDetect Real Images | Accuracy (%)99.9 | 3 | |
| Local Editing Detection | SID-Set Tampered Real Images | Accuracy99.4 | 3 | |
| Global Synthesis Detection | SID-Set FullSync Fake Images | Accuracy47 | 3 | |
| Global Synthesis Detection | AIGCDetectBenchmark Fake Images | Accuracy31.6 | 3 | |
| Global Synthesis Detection | FakeClue Fake Images | Accuracy55.6 | 3 |