Venus-DeFakerOne: Unified Fake Image Detection & Localization

About

In recent years, the rapid evolution of generative AI has fundamentally reshaped the paradigm of image forgery, breaking the traditional boundaries between document editing, natural image manipulation, DeepFake generation, and full-image AIGC synthesis. Despite this shift toward unified forgery generation, existing research in Fake Image Detection and Localization (FIDL) remains fragmented. This creates a mismatch between increasingly unified forgery generation mechanisms and the domain-specific detection paradigm. Bridging this mismatch poses two key challenges for FIDL: understanding cross-domain artifacts transfer and interference, and building a high-capacity unified foundation model for joint detection and localization. To address these challenges, we propose DeFakerOne, a data-centric, unified FIDL foundation model integrating InternVL2 and SAM2. DeFakerOne enables simultaneous image-level detection and pixel-level forgery localization across diverse scenarios. Extensive experiments demonstrate that DeFakerOne achieves state-of-the-art performance, outperforming baselines on 39 forgery detection benchmarks and 9 localization benchmarks. Furthermore, the model exhibits superior robustness against real-world perturbations and state-of-the-art generators such as GPT-Image-2. Finally, we provide a systematic analysis of data scaling laws, cross-domain artifacts transfer-interference patterns, the necessity of fine-grained supervision, and the original resolution artifacts preservation, highlighting the design principles for scalable, robust, and unified FIDL.

GuangJian Team• 2026

Related benchmarks

Task	Dataset	Result
Deepfake Detection	CelebDF v2	AUC0.999	134
Forgery Localization	OpenMMSec	Accuracy81.85	49
AIGI Detection	BFree Online	B.Acc65.7	47
Synthetic Image Detection	Chameleon	Accuracy84.7	36
Pixel-level Forgery Localization	DocTamperFCD, DocTamperSCD, DocTamperTest, T-SROIE, Tampered IC13, OSTF, RTM document-oriented (full)	Binary F1 Score90.6	28
Image-level manipulation detection	DEFACTO 12k	AUC78.3	26
Image-level Document Forgery Detection	DocTamper FCD	Accuracy99.6	24
Pixel-level Forgery Localization	CASIAv1, COVERAGE, Columbia, NIST16, CocoGlide, AutoSplice nature-oriented (full)	Binary F1 Score80.9	24
Deepfake Detection	FaceForensics++ c40 (test)	AUC84.7	24
Image Deepfake Detection	WDF	AUC0.93	23

Showing 10 of 43 rows

Other info

Follow for update

@wizwand_team Discord