Detecting AI-Generated Video via Frame Consistency
About
The escalating quality of video generated by advanced video generation methods results in new security challenges, while there have been few relevant research efforts: 1) There is no open-source dataset for generated video detection, 2) No generated video detection method has been proposed so far. To this end, we propose an open-source dataset and a detection method for generated video for the first time. First, we propose a scalable dataset consisting of 964 prompts, covering various forgery targets, scenes, behaviors, and actions, as well as various generation models with different architectures and generation methods, including the most popular commercial models like OpenAI's Sora and Google's Veo. Second, we found via probing experiments that spatial artifact-based detectors lack generalizability. Hence, we propose a simple yet effective \textbf{de}tection model based on \textbf{f}rame \textbf{co}nsistency (\textbf{DeCoF}), which focuses on temporal artifacts by eliminating the impact of spatial artifacts during feature learning. Extensive experiments demonstrate the efficacy of DeCoF in detecting videos generated by unseen video generation models and confirm its powerful generalizability across several commercially proprietary models.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| AI-generated Video Detection | VidProm | AUC (MS)89.18 | 42 | |
| AI-generated Video Detection | EvalCrafter | Floor33 Score96.87 | 42 | |
| AI-generated Video Detection | VideoPhy 1.0 (test) | CVX Score53.29 | 42 | |
| Synthetic Video Detection | GenVideo (test) | Average Detection Rate92.43 | 34 | |
| AI-generated Video Detection | EvalCrafter 14 subsets (test) | Floor33 Score96.87 | 28 | |
| AI-generated Video Detection | VideoPhy | CVX AUC53.29 | 28 | |
| AI-generated Video Detection | GenVideo (test) | Mean Score81.67 | 23 | |
| Video Detection | GenVideo | ACC87.6 | 14 | |
| Video Detection | EvalCrafter | ACC89.75 | 14 | |
| Video Detection | VidProm | Accuracy85.47 | 14 |