Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SimMark: A Robust Sentence-Level Similarity-Based Watermarking Algorithm for Large Language Models

About

The widespread adoption of large language models (LLMs) necessitates reliable methods to detect LLM-generated text. We introduce SimMark, a robust sentence-level watermarking algorithm that makes LLMs' outputs traceable without requiring access to model internals, making it compatible with both open and API-based LLMs. By leveraging the similarity of semantic sentence embeddings combined with rejection sampling to embed detectable statistical patterns imperceptible to humans, and employing a soft counting mechanism, SimMark achieves robustness against paraphrasing attacks. Experimental results demonstrate that SimMark sets a new benchmark for robust watermarking of LLM-generated content, surpassing prior sentence-level watermarking techniques in robustness, sampling efficiency, and applicability across diverse domains, all while maintaining the text quality and fluency.

Amirhossein Dabiriaghdam, Lele Wang• 2025

Related benchmarks

TaskDatasetResultRank
Watermark DetectionC4
Detection Accuracy (No Attack)77.6
24
Watermarking DetectionBookSum (test)
Detection Rate (No Attack)88.2
24
Watermarking Token EfficiencyBookSum (test)
Avg Tokens per Sentence186.7
5
Showing 3 of 3 rows

Other info

Follow for update