AliMark: Enhancing Robustness of Sentence-Level Watermarking Against Text Paraphrasing

About

Existing sentence-level watermarking methods enhance robustness to paraphrasing by anchoring watermarks in sentence semantics. However, their prefix-based designs remain vulnerable to structural perturbations, such as sentence splitting and merging, which commonly arise under strong paraphrasers like DIPPER and GPT-3.5. To mitigate this issue, we propose AliMark, a framework that reformulates sentence-level watermarking as a bit sequence encoding and alignment problem between a potentially watermarked text and a secret bit sequence. Notably, our approach adopts a two-stage detection strategy: we generate multiple restructured text variants and adaptively align their extracted bit sequences with the secret bit sequence to minimize alignment cost. This multi-candidate alignment design naturally improves robustness to sentence merges and splits. Extensive experiments demonstrate that AliMark substantially outperforms state-of-the-art baselines under diverse paraphrasing attacks.

Yuexin Li, Wenjie Qu, Linyu Wu, Yulin Chen, Yufei He, Tri Cao, Bryan Hooi, Jiaheng Zhang• 2026

Related benchmarks

Task	Dataset	Result
Watermark Detection	BookSum	TP @ FP=1%100	154
Watermark Detection	C4	TPR @ FPR=1%0.992	95
Watermarking	Natural Questions (NQ) (test)	AUROC100	45
Sentence-Level Watermarking	C4	AUROC99.9	40

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord