Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Universal Anti-forensics Attack against Image Forgery Detection via Multi-modal Guidance

About

The rapid advancement of AI-Generated Content (AIGC) technologies poses significant challenges for authenticity assessment. However, existing evaluation protocols largely overlook anti-forensics attack, failing to ensure the comprehensive robustness of state-of-the-art AIGC detectors in real-world applications. To bridge this gap, we propose ForgeryEraser, a framework designed to execute universal anti-forensics attack without access to the target AIGC detectors. We reveal an adversarial vulnerability stemming from the systemic reliance on Vision-Language Models (VLMs) as shared backbones (e.g., CLIP), where downstream AIGC detectors inherit the feature space of these publicly accessible models. Instead of traditional logit-based optimization, we design a multi-modal guidance loss to drive forged image embeddings within the VLM feature space toward text-derived authentic anchors to erase forgery traces, while repelling them from forgery anchors. Extensive experiments demonstrate that ForgeryEraser causes substantial performance degradation to advanced AIGC detectors on both global synthesis and local editing benchmarks. Moreover, ForgeryEraser induces explainable forensic models to generate explanations consistent with authentic images for forged images. Our code will be made publicly available.

Haipeng Li, Rongxuan Peng, Anwei Luo, Shunquan Tan, Changsheng Chen, Anastasia Antsiferova• 2026

Related benchmarks

TaskDatasetResultRank
Local Editing DetectionReal Images Protocol-1
Accuracy97.9
6
Local Editing DetectionFake Images Protocol-1
Accuracy9.4
6
Global Synthesis DetectionSID-Set FullSync Real Images
Accuracy99.4
3
Global Synthesis DetectionAIGCDetectBenchmark Real Images
Accuracy99.4
3
Global Synthesis DetectionFakeClue Real Images
Accuracy97.5
3
Global Synthesis DetectionUniversalFakeDetect Real Images
Accuracy (%)99.9
3
Local Editing DetectionSID-Set Tampered Real Images
Accuracy99.4
3
Global Synthesis DetectionSID-Set FullSync Fake Images
Accuracy47
3
Global Synthesis DetectionAIGCDetectBenchmark Fake Images
Accuracy31.6
3
Global Synthesis DetectionFakeClue Fake Images
Accuracy55.6
3
Showing 10 of 12 rows

Other info

Follow for update