Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SpecSem-Net: Integrating Spectral and Semantic Features for Robust AI-generated Video Detection

About

The remarkable visual fidelity of recent commercial video generative models, such as Sora and Veo, renders robust AI-generated video detection increasingly essential to prevent synthetic content from being indistinguishable from real videos and exploited for disinformation. However, existing detectors often fail due to an over-reliance on increasingly realistic semantic features, neglecting subtle spectral artifacts. In this paper, we propose SpecSem-Net, the first framework to introduce a semantic-guided spectral denoising mechanism specifically for high-fidelity AI-generated video detection. Specifically, we design a spectral module to extract high-frequency features via Fourier-Transform based filtering. Furthermore, to reduce misjudgments arising from spectral noise, we employ a Gated Merging Mechanism to adaptively fuse semantic context, effectively mitigating spectral noise. Additionally, to evaluate detector performance on the latest top-tier generative models, we construct a comprehensive benchmark comprising 5 SOTA commercial generators. Extensive experiments demonstrate that SpecSem-Net outperforms existing methods, achieving accuracies of 87.25% and 95.59% on our benchmark and public datasets, respectively.

Zixi Wei, Huixuaun Zhang, Xiaojun Wan• 2026

Related benchmarks

TaskDatasetResultRank
AI-generated Video DetectionGenVideo Crafter
ACC97.32
13
AI-generated Video DetectionWan Frontier Commercial Generators
Accuracy85.55
7
AI-generated Video DetectionKling Frontier Commercial Generators
Accuracy92.05
7
AI-generated Video DetectionVeo Frontier Commercial Generators
Accuracy93.55
7
AI-generated Video DetectionSora Frontier Commercial Generators
Accuracy79.3
7
AI-generated Video DetectionHailuo Frontier Commercial Generators
Accuracy91.55
7
AI-generated Video DetectionMean Across Frontier Commercial Generators
Accuracy87.25
7
AI-generated Video DetectionI2VGEN
Accuracy97.4
7
AI-generated Video DetectionOPEN SORA
Accuracy94.08
7
AI-generated Video DetectionPika
Accuracy96.96
7
Showing 10 of 13 rows

Other info

Follow for update