Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Video Signature: Implicit Watermarking for Video Diffusion Models

About

The rapid development of Artificial Intelligence Generated Content (AIGC) has led to significant progress in video generation, but also raises serious concerns about intellectual property protection and reliable content tracing. Watermarking is a widely adopted solution to this issue, yet existing methods for video generation mainly follow a post-generation paradigm, which often fails to effectively balance the trade-off between video quality and watermark extraction. Meanwhile, current in-generation methods that embed the watermark into the initial Gaussian noise usually incur substantial additional computation. To address these issues, we propose \textbf{Video Signature} (\textsc{VidSig}), an implicit watermarking method for video diffusion models that enables imperceptible and adaptive watermark integration during video generation with almost no extra latency. Specifically, we partially fine-tune the latent decoder, where \textbf{Perturbation-Aware Suppression} (PAS) pre-identifies and freezes perceptually sensitive layers to preserve visual quality. Beyond spatial fidelity, we further enhance temporal consistency by introducing a lightweight \textbf{Temporal Alignment} module that guides the decoder to generate coherent frame sequences during fine-tuning. Experimental results show that \textsc{VidSig} achieves the best trade-off among watermark extraction accuracy, video quality, and watermark latency. It also demonstrates strong robustness against both spatial and temporal tamper, and remains stable across different video lengths and resolutions, highlighting its practicality in real-world scenarios.

Yu Huang, Junhao Chen, Shuliang Liu, Hanqian Li, Jungang Li, Qi Zheng, Aiwei Liu, Yi R. Fung, Xuming Hu• 2025

Related benchmarks

TaskDatasetResultRank
Video WatermarkingVBench ModelScope (MS) (test)
Bit Accuracy100
4
Watermark RobustnessModelScope
Robustness (Gauss Attack I)0.937
4
Video WatermarkingStable Video Diffusion Synthesized Videos SVD-XT
Bit Accuracy95.8
4
Watermark RobustnessSVD-XT
Robustness (Gauss Noise) - Cat I0.937
4
Showing 4 of 4 rows

Other info

Follow for update