Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Venus-DeFakerOne: Unified Fake Image Detection & Localization

About

In recent years, the rapid evolution of generative AI has fundamentally reshaped the paradigm of image forgery, breaking the traditional boundaries between document editing, natural image manipulation, DeepFake generation, and full-image AIGC synthesis. Despite this shift toward unified forgery generation, existing research in Fake Image Detection and Localization (FIDL) remains fragmented. This creates a mismatch between increasingly unified forgery generation mechanisms and the domain-specific detection paradigm. Bridging this mismatch poses two key challenges for FIDL: understanding cross-domain artifacts transfer and interference, and building a high-capacity unified foundation model for joint detection and localization. To address these challenges, we propose DeFakerOne, a data-centric, unified FIDL foundation model integrating InternVL2 and SAM2. DeFakerOne enables simultaneous image-level detection and pixel-level forgery localization across diverse scenarios. Extensive experiments demonstrate that DeFakerOne achieves state-of-the-art performance, outperforming baselines on 39 forgery detection benchmarks and 9 localization benchmarks. Furthermore, the model exhibits superior robustness against real-world perturbations and state-of-the-art generators such as GPT-Image-2. Finally, we provide a systematic analysis of data scaling laws, cross-domain artifacts transfer-interference patterns, the necessity of fine-grained supervision, and the original resolution artifacts preservation, highlighting the design principles for scalable, robust, and unified FIDL.

GuangJian Team• 2026

Related benchmarks

TaskDatasetResultRank
Deepfake DetectionCelebDF v2
AUC0.999
134
Forgery LocalizationOpenMMSec
Accuracy81.85
49
AIGI DetectionBFree Online
B.Acc65.7
47
Pixel-level Forgery LocalizationDocTamperFCD, DocTamperSCD, DocTamperTest, T-SROIE, Tampered IC13, OSTF, RTM document-oriented (full)
Binary F1 Score90.6
28
Image-level manipulation detectionDEFACTO 12k
AUC78.3
26
Image-level Document Forgery DetectionDocTamper FCD
Accuracy99.6
24
Pixel-level Forgery LocalizationCASIAv1, COVERAGE, Columbia, NIST16, CocoGlide, AutoSplice nature-oriented (full)
Binary F1 Score80.9
24
Deepfake DetectionFaceForensics++ c40 (test)
AUC84.7
24
Image Deepfake DetectionWDF
AUC0.93
23
Synthetic Image DetectionChameleon
Accuracy84.7
23
Showing 10 of 43 rows

Other info

Follow for update