Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

AWM: Accurate Weight-Matrix Fingerprint for Large Language Models

About

Protecting the intellectual property of large language models (LLMs) is crucial, given the substantial resources required for their training. Consequently, there is an urgent need for both model owners and third parties to determine whether a suspect LLM is trained from scratch or derived from an existing base model. However, the intensive post-training processes that models typically undergo-such as supervised fine-tuning, extensive continued pretraining, reinforcement learning, multi-modal extension, pruning, and upcycling-pose significant challenges to reliable identification. In this work, we propose a training-free fingerprinting method based on weight matrices. We leverage the Linear Assignment Problem (LAP) and an unbiased Centered Kernel Alignment (CKA) similarity to neutralize the effects of parameter manipulations, yielding a highly robust and high-fidelity similarity metric. On a comprehensive testbed of 60 positive and 90 negative model pairs, our method demonstrates exceptional robustness against all six aforementioned post-training categories while exhibiting a near-zero risk of false positives. By achieving perfect scores on all classification metrics, our approach establishes a strong basis for reliable model lineage verification. Moreover, the entire computation completes within 30s on an NVIDIA 3090 GPU. The code is available at https://github.com/LUMIA-Group/AWM.

Boyi Zeng, Lin Chen, Ziwei He, Xinbing Wang, Zhouhan Lin• 2025

Related benchmarks

TaskDatasetResultRank
Model FingerprintingSFT Positive Samples
Absolute Z-score355.2
30
Model FingerprintingContinual Pretrain Positive Samples
Absolute Z-score268.7
30
Model FingerprintingUpcycling Positive Samples
Absolute Z-score332.2
30
Model FingerprintingMulti Modal Positive Samples
Absolute Z-score355.1
30
Model FingerprintingRL Positive Samples
Absolute Z-score355.3
30
Model FingerprintingPruning Positive Samples
Absolute Z-score344.1
30
Model FingerprintingModel Fingerprinting Dataset (SFT)
|Z| Score353.8
5
Model FingerprintingModel Fingerprinting Dataset (CPT)
|Z|219.2
5
Model FingerprintingModel Fingerprinting Dataset (UP)
|Z|287.6
5
Model FingerprintingModel Fingerprinting Dataset MM
|Z|334.6
5
Showing 10 of 13 rows

Other info

Follow for update