UniGenDet: A Unified Generative-Discriminative Framework for Co-Evolutionary Image Generation and Generated Image Detection

About

In recent years, significant progress has been made in both image generation and generated image detection. Despite their rapid, yet largely independent, development, these two fields have evolved distinct architectural paradigms: the former predominantly relies on generative networks, while the latter favors discriminative frameworks. A recent trend in both domains is the use of adversarial information to enhance performance, revealing potential for synergy. However, the significant architectural divergence between them presents considerable challenges. Departing from previous approaches, we propose UniGenDet: a Unified generative-discriminative framework for co-evolutionary image Generation and generated image Detection. To bridge the task gap, we design a symbiotic multimodal self-attention mechanism and a unified fine-tuning algorithm. This synergy allows the generation task to improve the interpretability of authenticity identification, while authenticity criteria guide the creation of higher-fidelity images. Furthermore, we introduce a detector-informed generative alignment mechanism to facilitate seamless information exchange. Extensive experiments on multiple datasets demonstrate that our method achieves state-of-the-art performance. Code: \href{https://github.com/Zhangyr2022/UniGenDet}{https://github.com/Zhangyr2022/UniGenDet}.

Yanran Zhang, Wenzhao Zheng, Yifei Li, Bingyao Yu, Yu Zheng, Lei Chen, Jiwen Lu, Jie Zhou• 2026

Related benchmarks

Task	Dataset	Result
Text-to-Image Generation	GenEval	--	914
Synthetic Image Detection	DMimage (Overall)	Accuracy98.6	18
Synthetic Image Detection and Artifact Explanation	FakeClue	Accuracy98	16
Synthetic Image Detection	DMimage (Real)	Accuracy99	9
Synthetic Image Detection	DMimage (Fake)	Accuracy97.2	9
Synthetic Image Detection	ARForensics	LlamaGen Accuracy89.4	9
Text-to-Image Generation	LAION 5,000 prompts	FID17.5	3
Text-to-Image Generation Diversity	LAION 500 prompts	CLIP Similarity0.802	2

Showing 8 of 8 rows

Other info

GitHub

Follow for update

@wizwand_team Discord