Orthogonal Subspace Decomposition for Generalizable AI-Generated Image Detection

About

AI-generated images (AIGIs), such as natural or face images, have become increasingly important yet challenging. In this paper, we start from a new perspective to excavate the reason behind the failure generalization in AIGI detection, named the \textit{asymmetry phenomenon}, where a naively trained detector tends to favor overfitting to the limited and monotonous fake patterns, causing the feature space to become highly constrained and low-ranked, which is proved seriously limiting the expressivity and generalization. One potential remedy is incorporating the pre-trained knowledge within the vision foundation models (higher-ranked) to expand the feature space, alleviating the model's overfitting to fake. To this end, we employ Singular Value Decomposition (SVD) to decompose the original feature space into \textit{two orthogonal subspaces}. By freezing the principal components and adapting only the remained components, we preserve the pre-trained knowledge while learning fake patterns. Compared to existing full-parameters and LoRA-based tuning methods, we explicitly ensure orthogonality, enabling the higher rank of the whole feature space, effectively minimizing overfitting and enhancing generalization. We finally identify a crucial insight: our method implicitly learns \textit{a vital prior that fakes are actually derived from the real}, indicating a hierarchical relationship rather than independence. Modeling this prior, we believe, is essential for achieving superior generalization. Our codes are publicly available at \href{https://github.com/YZY-stack/Effort-AIGI-Detection}{GitHub}.

Zhiyuan Yan, Jiangming Wang, Peng Jin, Ke-Yue Zhang, Chengchun Liu, Shen Chen, Taiping Yao, Shouhong Ding, Baoyuan Wu, Li Yuan• 2024

Related benchmarks

Task	Dataset	Result
Deepfake Detection	DFDC	AUC84.8	230
Deepfake Detection	DFD	AUC0.965	193
AI-generated image detection	GenImage	Midjourney Detection Rate82.4	154
Generated Image Detection	GenImage (test)	Average Accuracy91.1	135
Deepfake Detection	CelebDF v2	AUC0.956	134
Deepfake Detection	DFDC (test)	AUC86.3	130
AI-generated image detection	Chameleon	Accuracy89.98	127
Deepfake Detection	CDF v2	AUC0.956	97
Deepfake Detection	CDFv1, CDFv2, DFD, DFDCP, DFDC (test)	Overall Average Score91.8	74
Face Forgery Detection	DFDC	AUC82.5	74

Showing 10 of 303 rows

...

Other info

Follow for update

@wizwand_team Discord