A Fingerprint for Large Language Models
About
Recent advances confirm that large language models (LLMs) can achieve state-of-the-art performance across various tasks. However, due to the resource-intensive nature of training LLMs from scratch, it is urgent and crucial to protect the intellectual property of LLMs against infringement. This has motivated the authors in this paper to propose a novel black-box fingerprinting technique for LLMs. We firstly demonstrate that the outputs of LLMs span a unique vector space associated with each model. We model the problem of fingerprint authentication as the task of evaluating the similarity between the space of the victim model and the space of the suspect model. To tackle with this problem, we introduce two solutions: the first determines whether suspect outputs lie within the victim's subspace, enabling fast infringement detection; the second reconstructs a joint subspace to detect models modified via parameter-efficient fine-tuning (PEFT). Experiments indicate that the proposed method achieves superior performance in fingerprint verification and robustness against the PEFT attacks. This work reveals inherent characteristics of LLMs and provides a promising solution for protecting LLMs, ensuring efficiency, generality and practicality.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Model Fingerprinting Robustness Evaluation | Pruning Robustness Evaluation Dataset | Similarity Score0.9999 | 127 | |
| Model Fingerprinting Robustness | Structured Pruning Suspects Sheared-Llama | Similarity Score99.99 | 42 | |
| Fingerprint Similarity | LLaMA2-7B | Similarity Score0.9891 | 24 | |
| Model Fingerprinting Robustness | Unstructured Pruning Suspects Llama-2-7b | Similarity Score99.99 | 21 | |
| Model Fingerprinting | Qwen2.5-derived suspects v0.1 | Similarity Score0.5275 | 12 | |
| Model Fingerprinting Robustness | Evollm-jp Weight Merging Wizardmath 7b 1.1 | Similarity Score0.9999 | 7 | |
| Model Fingerprinting Robustness | Evollm-jp-7b Weight Merging Abel-7b-002 | Similarity Score99.99 | 7 | |
| Model Fingerprinting Robustness | Fusellm-7b Distribution Merging Llama-2-7b | Similarity Score99.99 | 7 | |
| Model Fingerprinting Robustness | Llama-2-Finance-7B | Similarity Score99.99 | 7 | |
| Model Fingerprinting Robustness | Vicuna 1.5-7B | Similarity Score99.99 | 7 |