REEF: Representation Encoding Fingerprints for Large Language Models

About

Protecting the intellectual property of open-source Large Language Models (LLMs) is very important, because training LLMs costs extensive computational resources and data. Therefore, model owners and third parties need to identify whether a suspect model is a subsequent development of the victim model. To this end, we propose a training-free REEF to identify the relationship between the suspect and victim models from the perspective of LLMs' feature representations. Specifically, REEF computes and compares the centered kernel alignment similarity between the representations of a suspect model and a victim model on the same samples. This training-free REEF does not impair the model's general capabilities and is robust to sequential fine-tuning, pruning, model merging, and permutations. In this way, REEF provides a simple and effective way for third parties and models' owners to protect LLMs' intellectual property together. The code is available at https://github.com/tmylla/REEF.

Jie Zhang, Dongrui Liu, Chen Qian, Linfeng Zhang, Yong Liu, Yu Qiao, Jing Shao• 2024

Related benchmarks

Task	Dataset	Result
Model Fingerprinting Robustness Evaluation	Pruning Robustness Evaluation Dataset	Similarity Score0.9895	127
Model Fingerprinting Robustness	Structured Pruning Suspects Sheared-Llama	Similarity Score99.91	42
LLM fingerprinting	LLM Lineage Verification Dataset LLaMA and Qwen-style families	AUC0.996	35
Model Fingerprinting	Continual Pretrain Positive Samples	Absolute Z-score1.93	30
Model Fingerprinting	Pruning Positive Samples	Absolute Z-score1.8	30
Model Fingerprinting	SFT Positive Samples	Absolute Z-score1.95	30
Model Fingerprinting	Upcycling Positive Samples	Absolute Z-score1.49	30
Model Fingerprinting	Multi Modal Positive Samples	Absolute Z-score0.84	30
Model Fingerprinting	RL Positive Samples	Absolute Z-score1.96	30
Fingerprint Similarity	LLaMA2-7B	Similarity Score0.7458	24

Showing 10 of 62 rows

Other info

Follow for update

@wizwand_team Discord