REEF: Representation Encoding Fingerprints for Large Language Models
About
Protecting the intellectual property of open-source Large Language Models (LLMs) is very important, because training LLMs costs extensive computational resources and data. Therefore, model owners and third parties need to identify whether a suspect model is a subsequent development of the victim model. To this end, we propose a training-free REEF to identify the relationship between the suspect and victim models from the perspective of LLMs' feature representations. Specifically, REEF computes and compares the centered kernel alignment similarity between the representations of a suspect model and a victim model on the same samples. This training-free REEF does not impair the model's general capabilities and is robust to sequential fine-tuning, pruning, model merging, and permutations. In this way, REEF provides a simple and effective way for third parties and models' owners to protect LLMs' intellectual property together. The code is available at https://github.com/tmylla/REEF.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Model Fingerprinting Robustness Evaluation | Pruning Robustness Evaluation Dataset | Similarity Score0.9895 | 127 | |
| Model Fingerprinting Robustness | Structured Pruning Suspects Sheared-Llama | Similarity Score99.91 | 42 | |
| LLM fingerprinting | LLM Lineage Verification Dataset LLaMA and Qwen-style families | AUC0.996 | 35 | |
| Model Fingerprinting | Continual Pretrain Positive Samples | Absolute Z-score1.93 | 30 | |
| Model Fingerprinting | Pruning Positive Samples | Absolute Z-score1.8 | 30 | |
| Model Fingerprinting | SFT Positive Samples | Absolute Z-score1.95 | 30 | |
| Model Fingerprinting | Upcycling Positive Samples | Absolute Z-score1.49 | 30 | |
| Model Fingerprinting | Multi Modal Positive Samples | Absolute Z-score0.84 | 30 | |
| Model Fingerprinting | RL Positive Samples | Absolute Z-score1.96 | 30 | |
| Fingerprint Similarity | LLaMA2-7B | Similarity Score0.7458 | 24 |