Orthogonal Subspace Decomposition for Generalizable AI-Generated Image Detection
About
AI-generated images (AIGIs), such as natural or face images, have become increasingly important yet challenging. In this paper, we start from a new perspective to excavate the reason behind the failure generalization in AIGI detection, named the \textit{asymmetry phenomenon}, where a naively trained detector tends to favor overfitting to the limited and monotonous fake patterns, causing the feature space to become highly constrained and low-ranked, which is proved seriously limiting the expressivity and generalization. One potential remedy is incorporating the pre-trained knowledge within the vision foundation models (higher-ranked) to expand the feature space, alleviating the model's overfitting to fake. To this end, we employ Singular Value Decomposition (SVD) to decompose the original feature space into \textit{two orthogonal subspaces}. By freezing the principal components and adapting only the remained components, we preserve the pre-trained knowledge while learning fake patterns. Compared to existing full-parameters and LoRA-based tuning methods, we explicitly ensure orthogonality, enabling the higher rank of the whole feature space, effectively minimizing overfitting and enhancing generalization. We finally identify a crucial insight: our method implicitly learns \textit{a vital prior that fakes are actually derived from the real}, indicating a hierarchical relationship rather than independence. Modeling this prior, we believe, is essential for achieving superior generalization. Our codes are publicly available at \href{https://github.com/YZY-stack/Effort-AIGI-Detection}{GitHub}.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Deepfake Detection | DFDC | AUC77.65 | 150 | |
| Generated Image Detection | GenImage (test) | Average Accuracy91.1 | 124 | |
| Deepfake Detection | DFDC (test) | AUC86.3 | 122 | |
| AI-generated image detection | Chameleon | Accuracy57 | 107 | |
| AI-generated image detection | GenImage | Midjourney Detection Rate82.4 | 106 | |
| Deepfake Detection | DFD | AUC0.8698 | 91 | |
| Deepfake Detection | CDFv1, CDFv2, DFD, DFDCP, DFDC (test) | Overall Average Score91.8 | 74 | |
| Artifact Detection | OpenMMSec | Deepfake EFS94.7 | 68 | |
| Face Forgery Detection | DFDCP | Frame-level AUC86 | 64 | |
| Deepfake Detection | CelebDF v2 | AUC0.956 | 57 |