Generalizing Face Forgery Detection with High-frequency Features
About
Current face forgery detection methods achieve high accuracy under the within-database scenario where training and testing forgeries are synthesized by the same algorithm. However, few of them gain satisfying performance under the cross-database scenario where training and testing forgeries are synthesized by different algorithms. In this paper, we find that current CNN-based detectors tend to overfit to method-specific color textures and thus fail to generalize. Observing that image noises remove color textures and expose discrepancies between authentic and tampered regions, we propose to utilize the high-frequency noises for face forgery detection. We carefully devise three functional modules to take full advantage of the high-frequency features. The first is the multi-scale high-frequency feature extraction module that extracts high-frequency noises at multiple scales and composes a novel modality. The second is the residual-guided spatial attention module that guides the low-level RGB feature extractor to concentrate more on forgery traces from a new perspective. The last is the cross-modality attention module that leverages the correlation between the two complementary modalities to promote feature learning for each other. Comprehensive evaluations on several benchmark databases corroborate the superior generalization performance of our proposed method.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Deepfake Detection | DFDC | AUC69.5 | 135 | |
| Deepfake Detection | DFDC (test) | AUC79.7 | 87 | |
| Deepfake Detection | DFD | AUC0.919 | 77 | |
| Fake Face Detection | Celeb-DF v2 (test) | AUC79.4 | 50 | |
| Face Forgery Detection | Celeb-DF | AUC69.3 | 46 | |
| Deepfake Detection | CDFv1, CDFv2, DFD, DFDCP, DFDC (test) | DFD Score81.2 | 42 | |
| Deepfake Detection | CelebDF v2 | AUC0.755 | 40 | |
| Deepfake Detection | Cross-Domain Evaluation (test) | CDFv1 Score79.26 | 31 | |
| Deepfake Detection | FaceForensics++ c23 (train) | FF c23 Score95.76 | 31 | |
| Frame-level Deepfake Detection | DFD | AUC85.51 | 28 |