Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

BiMark: Unbiased Multilayer Watermarking for Large Language Models

About

Recent advances in Large Language Models (LLMs) have raised urgent concerns about LLM-generated text authenticity, prompting regulatory demands for reliable identification mechanisms. Although watermarking offers a promising solution, existing approaches struggle to simultaneously achieve three critical requirements: text quality preservation, model-agnostic detection, and message embedding capacity, which are crucial for practical implementation. To achieve these goals, the key challenge lies in balancing the trade-off between text quality preservation and message embedding capacity. To address this challenge, we propose BiMark, a novel watermarking framework that achieves these requirements through three key innovations: (1) a bit-flip unbiased reweighting mechanism enabling model-agnostic detection, (2) a multilayer architecture enhancing detectability without compromising generation quality, and (3) an information encoding approach supporting multi-bit watermarking. Through theoretical analysis and extensive experiments, we validate that, compared to state-of-the-art multi-bit watermarking methods, BiMark achieves up to 30% higher extraction rates for short texts while maintaining text quality indicated by lower perplexity, and performs comparably to non-watermarked text on downstream tasks such as summarization and translation.

Xiaoyan Feng, He Zhang, Yanjun Zhang, Leo Yu Zhang, Shirui Pan• 2025

Related benchmarks

TaskDatasetResultRank
Fake News DetectionFAKE NEWS
Accuracy97.83
66
Watermark Detectionbook_report
Accuracy99.06
48
Watermark Detectionmmw story
Accuracy99.61
48
Watermark Detectionfake_news
Accuracy98.81
48
Watermark Detectionlongform_qa
Accuracy97.09
48
Watermark Detectionfinance_qa
Accuracy97.13
48
Watermark Detectiondolly_cw
Accuracy94.38
48
Detection Accuracydolly_cw
Accuracy96
24
Detection Accuracymmw story
Accuracy98.82
24
Watermark DetectionC4 subset
Accuracy99.62
24
Showing 10 of 33 rows

Other info

Follow for update