Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Watermarking Autoregressive Image Generation

About

Watermarking the outputs of generative models has emerged as a promising approach for tracking their provenance. Despite significant interest in autoregressive image generation models and their potential for misuse, no prior work has attempted to watermark their outputs at the token level. In this work, we present the first such approach by adapting language model watermarking techniques to this setting. We identify a key challenge: the lack of reverse cycle-consistency (RCC), wherein re-tokenizing generated image tokens significantly alters the token sequence, effectively erasing the watermark. To address this and to make our method robust to common image transformations, neural compression, and removal attacks, we introduce (i) a custom tokenizer-detokenizer finetuning procedure that improves RCC, and (ii) a complementary watermark synchronization layer. As our experiments demonstrate, our approach enables reliable and robust watermark detection with theoretically grounded p-values. Code and models are available at https://github.com/facebookresearch/wmar.

Nikola Jovanovi\'c, Ismail Labiad, Tom\'a\v{s} Sou\v{c}ek, Martin Vechev, Pierre Fernandez• 2025

Related benchmarks

TaskDatasetResultRank
Image GenerationImageNet (val)--
247
Audio Quality EvaluationMoshi conversational audio prompts
VGGish Score0.407
13
Audio Quality EvaluationMoshi LibriSpeech prompts
VGGish Score2.195
13
Watermark DetectionImageNet RAR-XL 256x256
FID4.23
10
Zero-bit Watermark DetectionImageNet (val)
TPR@FPR=1%99.6
9
Audio Generation QualityMusicCaps MusicGen 32kHz (val)
FAD (VGGish)1.193
4
Zero-bit Watermark DetectionLlamaGen
TPR (JPEG)93.1
3
Showing 7 of 7 rows

Other info

Follow for update