Orthogonal Subspace Decomposition for Generalizable AI-Generated Image Detection
About
AI-generated images (AIGIs), such as natural or face images, have become increasingly important yet challenging. In this paper, we start from a new perspective to excavate the reason behind the failure generalization in AIGI detection, named the \textit{asymmetry phenomenon}, where a naively trained detector tends to favor overfitting to the limited and monotonous fake patterns, causing the feature space to become highly constrained and low-ranked, which is proved seriously limiting the expressivity and generalization. One potential remedy is incorporating the pre-trained knowledge within the vision foundation models (higher-ranked) to expand the feature space, alleviating the model's overfitting to fake. To this end, we employ Singular Value Decomposition (SVD) to decompose the original feature space into \textit{two orthogonal subspaces}. By freezing the principal components and adapting only the remained components, we preserve the pre-trained knowledge while learning fake patterns. Compared to existing full-parameters and LoRA-based tuning methods, we explicitly ensure orthogonality, enabling the higher rank of the whole feature space, effectively minimizing overfitting and enhancing generalization. We finally identify a crucial insight: our method implicitly learns \textit{a vital prior that fakes are actually derived from the real}, indicating a hierarchical relationship rather than independence. Modeling this prior, we believe, is essential for achieving superior generalization. Our codes are publicly available at \href{https://github.com/YZY-stack/Effort-AIGI-Detection}{GitHub}.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Generated Image Detection | GenImage (test) | Average Accuracy91.1 | 103 | |
| Artifact Detection | OpenMMSec | Deepfake EFS94.7 | 68 | |
| AI-generated image detection | GenImage | Midjourney Detection Rate82.4 | 65 | |
| Deepfake Detection | CDFv1, CDFv2, DFD, DFDCP, DFDC (test) | DFD Score95.721 | 42 | |
| Deepfake Detection | DFDCP (test) | AUC92.89 | 27 | |
| Video Deepfake Detection | DF-TIMIT (test) | AUC94.96 | 27 | |
| Deepfake Detection | FaceForensics++ c23 (test) | AUC94.52 | 26 | |
| Face Forgery Detection | DFDC | AUC82.5 | 25 | |
| Face Forgery Detection | S2CFP (test) | Score (@ijustine)85.42 | 24 | |
| AI-generated image detection | GenImage 1.0 (test) | Midjourney Detection Rate82.4 | 24 |