Exposing and Mitigating Temporal Attack in Deepfake Video Detection

About

While spatiotemporal deepfake detectors achieve high AUC, our experiments reveal their susceptibility to evasion attacks. These models tend to overfit on fragile temporal spectrum cues, rather than learning robust semantic causality. To mitigate this vulnerability, we propose SpInShield, a temporal spectral-invariant defense framework explicitly designed to decouple semantic motion from manipulatable spectral artifacts. We propose a learnable spectral adversary that dynamically synthesizes severe spectral deformations, simulating extreme attack scenarios. By employing a shortcut suppression optimization strategy, SpInShield compels the encoder to extract reliable forensic cues while purging unstable spectral statistics from the latent space. Experiments show that SpInShield obtains competitive performance on widely used datasets and outperforms the strongest baseline by 21.30 percentage points in AUC under simulated amplitude spectral attacks.

Zheyuan Gu, Minghao Shao, Zhen Wang, Yusong Wang, Mingkun Xu, Shijie Zhang, Hao Jiang• 2026

Related benchmarks

Task	Dataset	Result
Deepfake Detection	DFD	AUC0.981	193
Deepfake Detection	CelebDF v2	AUC0.948	134
Deepfake Detection	CDF v2	AUC0.9052	97
Deepfake Detection	FaceForensics++ (test)	AUC89.92	65
Image Deepfake Detection	DFo	AUC0.9481	62
Deepfake Detection	WDF	AUC0.889	54
Deepfake Detection	FaceForensics++ c23 (test)	AUC99.6	52
Deepfake Detection	WildDeepfake (WDF)	Video-level AUC0.6183	31
Deepfake Detection	DiF	AUC0.8528	22
Deepfake Detection	DaG	AUC83.92	22

Showing 10 of 22 rows

Other info

Follow for update

@wizwand_team Discord