Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Standing on the Shoulders of Giants: Reprogramming Visual-Language Model for General Deepfake Detection

About

The proliferation of deepfake faces poses huge potential negative impacts on our daily lives. Despite substantial advancements in deepfake detection over these years, the generalizability of existing methods against forgeries from unseen datasets or created by emerging generative models remains constrained. In this paper, inspired by the zero-shot advantages of Vision-Language Models (VLMs), we propose a novel approach that repurposes a well-trained VLM for general deepfake detection. Motivated by the model reprogramming paradigm that manipulates the model prediction via input perturbations, our method can reprogram a pre-trained VLM model (e.g., CLIP) solely based on manipulating its input without tuning the inner parameters. First, learnable visual perturbations are used to refine feature extraction for deepfake detection. Then, we exploit information of face embedding to create sample-level adaptative text prompts, improving the performance. Extensive experiments on several popular benchmark datasets demonstrate that (1) the cross-dataset and cross-manipulation performances of deepfake detection can be significantly and consistently improved (e.g., over 88\% AUC in cross-dataset setting from FF++ to WildDeepfake); (2) the superior performances are achieved with fewer trainable parameters, making it a promising approach for real-world applications.

Kaiqing Lin, Yuzhen Lin, Weixiang Li, Taiping Yao, Bin Li• 2024

Related benchmarks

TaskDatasetResultRank
Deepfake DetectionCelebDF v2
AUC0.899
57
Frame-level Deepfake DetectionDFD
AUC85.8
42
Video-level Deepfake DetectionDFDC
AUC0.81
34
AI-generated image detectionStyleGAN--
29
Frame-level Deepfake DetectionDFDC
AUC0.773
26
Video-level Deepfake DetectionDFD
AUC0.951
25
Frame-level Deepfake DetectionCeleb-DF v2
AUROC80
17
Frame-level Deepfake DetectionCeleb-DF v1
AUROC83
15
Deepfake DetectionDF40 and FFHQ unseen generators
Average Accuracy (ACC)88.04
14
Deepfake AttributionUnseen Advanced Generators VAE, HART, FLUX
VAE Accuracy60.34
14
Showing 10 of 22 rows

Other info

Follow for update