Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Majority Bit-Aware Watermarking For Large Language Models

About

The growing deployment of Large Language Models (LLMs) has raised concerns about their misuse in generating harmful or deceptive content. To address this issue, watermarking methods have been proposed to embed identifiable multi-bit messages into generated text for misuse tracing. However, existing methods often suffer from a fundamental trade-off between text quality and decoding accuracy. In particular, they have to restrict the size of the preferred token set (i.e., green list) during encoding to maintain a detectable watermark signal for decoding, which inevitably degrades generation quality. To improve this trade-off, we propose a novel message encoding paradigm called \textit{majority bit-aware encoding}, which relaxes the watermark signal strength from the green list size. This strategy allows for a strong watermark signal to be preserved in generated texts even when using a large green list. We introduce two instantiations of this paradigm: MajorMark and MajorMark$^{+}$, where the latter is specifically optimized for long messages. Extensive experiments on state-of-the-art LLMs demonstrate that our methods achieve higher decoding accuracy and superior text quality compared to prior baselines.

Jiahao Xu, Rui Hu, Olivera Kotevska, Zikai Zhang• 2025

Related benchmarks

TaskDatasetResultRank
Text WatermarkingC4 Message length b=8 v1 (test)
BA100
18
Story GenerationWritingPrompts
Brier Accuracy (BA)98.44
16
Text SummarizationCNN/DailyMail
BA95.78
16
Semantic and Lexical Fidelity EvaluationWatermarked Text Generation (watermarked vs unwatermarked outputs)
BERTScore0.8449
12
Text WatermarkingC4 Message length b=64 v1 (test)
BA99.69
12
WatermarkingEssays b=32
Bit Accuracy100
12
WatermarkingEssays b=64
BA99.69
12
WatermarkingOpenGen b=32
Bit Accuracy (BA)100
12
WatermarkingOpenGen b=64
BA99.06
12
Text WatermarkingC4 Message length b=32 v1 (test)
Bit Accuracy100
12
Showing 10 of 15 rows

Other info

Follow for update