WorldCup Sampling for Multi-bit LLM Watermarking

About

As large language models (LLMs) generate increasingly human-like text, watermarking has emerged as a promising solution for reliable attribution beyond mere detection. While multi-bit watermarking enables richer provenance encoding, existing approaches typically extend zero-bit watermarking schemes by introducing static logit perturbations and counting-based decoding strategies, which can degrade text quality and compromise decoding robustness as the payload increases. In this paper, we propose WorldCup, a multi-bit watermarking framework for LLMs that models the sampling process as a structured communication channel and embeds message bits through a hierarchical competition mechanism guided by complementary signals. Moreover, WorldCup incorporates entropy-aware modulation to preserve generation quality and enables robust message recovery via confidence-aware decoding that accounts for token-level reliability. Comprehensive experiments demonstrate that WorldCup achieves a strong balance across message capacity, detectability, robustness, text quality, and decoding efficiency, consistently outperforming prior baselines. We believe that this work establishes a scalable and principled foundation for future research on multi-bit watermarking in LLMs.

Yidan Wang, Yubing Ren, Yanan Cao, Li Guo• 2026

Related benchmarks

Task	Dataset	Result
Multi-bit LLM Watermarking	C4 GEMMA2-9B-BASE Max 256 Tokens	AUC1	20
Multi-bit LLM Watermarking	Gemma2-9B-Base Max 256 Tokens	AUC1	20
Multi-bit LLM Watermarking	C4 LLaMA3-8B-BASE Max 128 Tokens	AUC1	20
Multi-bit LLM Watermarking	C4 LLaMA3-8B-BASE Max 256 Tokens	AUC100	20
Multi-bit LLM Watermarking	C4 GEMMA2-9B-BASE Max 128 Tokens	AUC100	20
Multi-bit LLM Watermarking	LLaMA3-8B-Base Max 128 Tokens	AUC1	20
Multi-bit LLM Watermarking	LLaMA3-8B-Base Max 256 Tokens	AUC1	20
Multi-bit LLM Watermarking	Gemma2-9B-Base Max 128 Tokens	AUC0.998	20
Long-form QA	Long-form QA Short Q, Long A (test)	GPT4 Score6.182	15
Machine Translation	Machine Translation Short Q, Short A (test)	BLEU0.417	15

Showing 10 of 12 rows

Other info

Follow for update

@wizwand_team Discord