AI-Generated Video Detection via Spatio-Temporal Anomaly Learning
About
The advancement of generation models has led to the emergence of highly realistic artificial intelligence (AI)-generated videos. Malicious users can easily create non-existent videos to spread false information. This letter proposes an effective AI-generated video detection (AIGVDet) scheme by capturing the forensic traces with a two-branch spatio-temporal convolutional neural network (CNN). Specifically, two ResNet sub-detectors are learned separately for identifying the anomalies in spatical and optical flow domains, respectively. Results of such sub-detectors are fused to further enhance the discrimination ability. A large-scale generated video dataset (GVD) is constructed as a benchmark for model training and evaluation. Extensive experimental results verify the high generalization and robustness of our AIGVDet scheme. Code and dataset will be available at https://github.com/multimediaFor/AIGVDet.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Synthetic Video Detection | GenVideo (test) | Average Detection Rate70.14 | 34 | |
| AI-generated Video Detection | VideoPhy 1.0 (test) | CVX Score78.13 | 28 | |
| AI-generated Video Detection | EvalCrafter | Floor33 Score79.52 | 28 | |
| AI-generated Video Detection | VideoPhy | CVX AUC78.13 | 14 | |
| AI-generated Video Detection | GenVideo | MS Score70.91 | 14 | |
| AI-generated Video Detection | VidProm | AUC (MS)60.11 | 14 | |
| AI-generated Video Detection | VidProM (test) | MS Performance63.33 | 14 | |
| Video Detection | VideoPhy | Accuracy53.33 | 14 | |
| Video Detection | VidProm | Accuracy47.25 | 14 | |
| Video Detection | GenVideo | ACC49.07 | 14 |