D3: Training-Free AI-Generated Video Detection Using Second-Order Features
About
The evolution of video generation techniques, such as Sora, has made it increasingly easy to produce high-fidelity AI-generated videos, raising public concern over the dissemination of synthetic content. However, existing detection methodologies remain limited by their insufficient exploration of temporal artifacts in synthetic videos. To bridge this gap, we establish a theoretical framework through second-order dynamical analysis under Newtonian mechanics, subsequently extending the Second-order Central Difference features tailored for temporal artifact detection. Building on this theoretical foundation, we reveal a fundamental divergence in second-order feature distributions between real and AI-generated videos. Concretely, we propose Detection by Difference of Differences (D3), a novel training-free detection method that leverages the above second-order temporal discrepancies. We validate the superiority of our D3 on 4 open-source datasets (Gen-Video, VideoPhy, EvalCrafter, VidProM), 40 subsets in total. For example, on GenVideo, D3 outperforms the previous best method by 10.39% (absolute) mean Average Precision. Additional experiments on time cost and post-processing operations demonstrate D3's exceptional computational efficiency and strong robust performance. Our code is available at https://github.com/Zig-HS/D3.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Deepfake Detection | DFDC | AUC68.29 | 230 | |
| Deepfake Detection | DFD | AUC0.7912 | 193 | |
| Deepfake Detection | CDF v2 | AUC0.7455 | 97 | |
| AI-generated Video Detection | EA-Video seen (evaluation) | Accuracy51.2 | 88 | |
| Deepfake Detection | FaceForensics++ (test) | AUC96.24 | 65 | |
| AI-generated Video Detection | VideoPhy 1.0 (test) | CVX Score90.23 | 42 | |
| AI-generated Video Detection | VidProm | AUC (MS)87.61 | 42 | |
| AI-generated Video Detection | EvalCrafter | Floor33 Score94.09 | 42 | |
| Synthetic Video Detection | GenVideo (test) | Average Detection Rate92.2 | 34 | |
| AI-generated Video Detection | VideoPhy | CVX AUC90.23 | 28 |