Robust Invisible Video Watermarking with Attention
About
The goal of video watermarking is to embed a message within a video file in a way such that it minimally impacts the viewing experience but can be recovered even if the video is redistributed and modified, allowing media producers to assert ownership over their content. This paper presents RivaGAN, a novel architecture for robust video watermarking which features a custom attention-based mechanism for embedding arbitrary data as well as two independent adversarial networks which critique the video quality and optimize for robustness. Using this technique, we are able to achieve state-of-the-art results in deep learning-based video watermarking and produce watermarked videos which have minimal visual distortion and are robust against common video processing operations.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Watermark Extraction | COCO | Bit Accuracy84 | 98 | |
| Image Watermarking | ImageNet | Bit Accuracy (Overall)78 | 98 | |
| Watermark Detection | ImageNet 2014 (val) | Detection Rate (Level 1)92 | 66 | |
| Watermark Generation | COCO | PSNR40.57 | 21 | |
| Image Watermarking | MS-COCO | PSNR40.57 | 21 | |
| Image Watermarking | Stable Diffusion V2.1 | -- | 20 | |
| Image Watermarking | DiffDB | PSNR40.6 | 17 | |
| Generative Image Watermarking | Stable Diffusion Server Scenario v1-5 (test) | CLIP Score0.3628 | 10 | |
| Image Watermarking | WikiArt | PSNR40.44 | 8 | |
| Video Watermarking | Video Watermarking (test) | FPS2.4554 | 8 |