WET: Overcoming Paraphrasing Vulnerabilities in Embeddings-as-a-Service with Linear Transformation Watermarks

About

Embeddings-as-a-Service (EaaS) is a service offered by large language model (LLM) developers to supply embeddings generated by LLMs. Previous research suggests that EaaS is prone to imitation attacks -- attacks that clone the underlying EaaS model by training another model on the queried embeddings. As a result, EaaS watermarks are introduced to protect the intellectual property of EaaS providers. In this paper, we first show that existing EaaS watermarks can be removed by paraphrasing when attackers clone the model. Subsequently, we propose a novel watermarking technique that involves linearly transforming the embeddings, and show that it is empirically and theoretically robust against paraphrasing.

Anudeex Shetty, Qiongkai Xu, Jey Han Lau• 2024

Related benchmarks

Task	Dataset	Result
Text Classification	AG-News	Accuracy93.4	248
Text Classification	SST2	Accuracy93.35	71
Text Classification	MIND	Accuracy76.91	48
Text Classification	AGNews	Accuracy93.32	38
Text Classification	SST-2	Accuracy93.27	24
Text Classification	Enron Spam	Accuracy (ACC)95.38	21

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord