Hidden in the Noise: Two-Stage Robust Watermarking for Images

About

As the quality of image generators continues to improve, deepfakes become a topic of considerable societal debate. Image watermarking allows responsible model owners to detect and label their AI-generated content, which can mitigate the harm. Yet, current state-of-the-art methods in image watermarking remain vulnerable to forgery and removal attacks. This vulnerability occurs in part because watermarks distort the distribution of generated images, unintentionally revealing information about the watermarking techniques. In this work, we first demonstrate a distortion-free watermarking method for images, based on a diffusion model's initial noise. However, detecting the watermark requires comparing the initial noise reconstructed for an image to all previously used initial noises. To mitigate these issues, we propose a two-stage watermarking framework for efficient detection. During generation, we augment the initial noise with generated Fourier patterns to embed information about the group of initial noises we used. For detection, we (i) retrieve the relevant group of noises, and (ii) search within the given group for an initial noise that might match our image. This watermarking approach achieves state-of-the-art robustness to forgery and removal against a large battery of attacks.

Kasra Arabi, Benjamin Feuer, R. Teal Witter, Chinmay Hegde, Niv Cohen• 2024

Related benchmarks

Task	Dataset	Result
Watermark Verification Runtime	Gustavo prompts 2022	VAE Encode Time (s)0.037	16
Watermark Attack	Stable-Diffusion-Prompts	Clean Scenario Performance0.00e+0	9
Image Watermarking	Stable Diffusion 2.0	TPR (Clean)100	8
Watermark Resistance against Generative Forgery Attacks	Stable Diffusion Latent Forgery Attack (LFA) V2	ASR100	5
Watermark Forgery Detection	Cat Attack Generated Images	ROC-AUC0.00e+0	5
Watermark Resistance against Generative Forgery Attacks	Stable Diffusion subjected to Regeneration with the Private Model (RPM) V2	ASR100	5
Watermark Resistance against Generative Forgery Attacks	Stable Diffusion Coherence-Preserving Semantic Injection (CSI) V2	ASR100	5
Watermark Verification Runtime	VBench 2.0	VAE Encode Time6.463	4
Watermark Forgery Detection	Private Model-Based Forgery Attack	ROC-AUC0.00e+0	4

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord