Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AliMark: Enhancing Robustness of Sentence-Level Watermarking Against Text Paraphrasing

About

Existing sentence-level watermarking methods enhance robustness to paraphrasing by anchoring watermarks in sentence semantics. However, their prefix-based designs remain vulnerable to structural perturbations, such as sentence splitting and merging, which commonly arise under strong paraphrasers like DIPPER and GPT-3.5. To mitigate this issue, we propose AliMark, a framework that reformulates sentence-level watermarking as a bit sequence encoding and alignment problem between a potentially watermarked text and a secret bit sequence. Notably, our approach adopts a two-stage detection strategy: we generate multiple restructured text variants and adaptively align their extracted bit sequences with the secret bit sequence to minimize alignment cost. This multi-candidate alignment design naturally improves robustness to sentence merges and splits. Extensive experiments demonstrate that AliMark substantially outperforms state-of-the-art baselines under diverse paraphrasing attacks.

Yuexin Li, Wenjie Qu, Linyu Wu, Yulin Chen, Yufei He, Tri Cao, Bryan Hooi, Jiaheng Zhang• 2026

Related benchmarks

TaskDatasetResultRank
Watermark DetectionBookSum
TP @ FP=1%100
154
Watermark DetectionC4
TPR @ FPR=1%0.992
95
WatermarkingNatural Questions (NQ) (test)
AUROC100
45
Sentence-Level WatermarkingC4
AUROC99.9
40
Showing 4 of 4 rows

Other info

Follow for update