AI-Generated Video Detection via Spatio-Temporal Anomaly Learning
About
The advancement of generation models has led to the emergence of highly realistic artificial intelligence (AI)-generated videos. Malicious users can easily create non-existent videos to spread false information. This letter proposes an effective AI-generated video detection (AIGVDet) scheme by capturing the forensic traces with a two-branch spatio-temporal convolutional neural network (CNN). Specifically, two ResNet sub-detectors are learned separately for identifying the anomalies in spatical and optical flow domains, respectively. Results of such sub-detectors are fused to further enhance the discrimination ability. A large-scale generated video dataset (GVD) is constructed as a benchmark for model training and evaluation. Extensive experimental results verify the high generalization and robustness of our AIGVDet scheme. Code and dataset will be available at https://github.com/multimediaFor/AIGVDet.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| AI-generated Video Detection | ViF-Bench T2V 1.0 (test) | Accuracy (Acc)69.08 | 13 | |
| AI-generated Video Detection | ViF-Bench I2V 1.0 (test) | Accuracy69.08 | 7 | |
| AI-generated Video Detection | GenVideo ModelScope | Accuracy50.36 | 6 | |
| AI-generated Video Detection | GenVideo Morph Studio | Accuracy50.21 | 6 | |
| AI-generated Video Detection | GenVideo Moon Valley | Accuracy50 | 6 | |
| AI-generated Video Detection | GenVideo Crafter | ACC50.18 | 6 | |
| AI-generated Video Detection | GenVideo Sora | Accuracy50 | 6 | |
| AI-generated Video Detection | GenVideo Wild Scrape | Accuracy50 | 6 | |
| AI-generated Video Detection | GenVideo Average | Accuracy50.09 | 6 | |
| AI-generated Video Detection | GenVideo Hot Shot | Accuracy50.07 | 6 |