Scalable Fingerprinting of Large Language Models

About

Model fingerprinting has emerged as a powerful tool for model owners to identify their shared model given API access. However, to lower false discovery rate, fight fingerprint leakage, and defend against coalitions of model users attempting to bypass detection, we argue that {\em scalability} is critical, i.e., scaling up the number of fingerprints one can embed into a model. Hence, we pose scalability as a crucial requirement for fingerprinting schemes. We experiment with fingerprint design at a scale significantly larger than previously considered, and introduce a new method, dubbed Perinucleus sampling, to generate scalable, persistent, and harmless fingerprints. We demonstrate that this scheme can add 24,576 fingerprints to a Llama-3.1-8B model -- two orders of magnitude more than existing schemes -- without degrading the model's utility. Our inserted fingerprints persist even after supervised fine-tuning on standard post-training data. We further address security risks for fingerprinting, and theoretically and empirically show how a scalable fingerprinting scheme like ours can mitigate these risks. Our code is available at https://github.com/SewoongLab/scalable-fingerprinting-of-llms

Anshul Nasery, Jonathan Hayase, Creston Brooks, Peiyao Sheng, Himanshu Tyagi, Pramod Viswanath, Sewoong Oh• 2025

Related benchmarks

Task	Dataset	Result
Fingerprint Robustness Evaluation	Prominent Deployment Scenarios Robustness Evaluation 1.0	Fingerprint Success Rate100	24
Fingerprint Detection	WildChat Fr	FSR0.00e+0	18
Fingerprint Detection	Active Output Modification	FSR80	18
Fingerprint Detection	English System Prompts	FSR100	9
Fingerprint Robustness Evaluation	System Prompts Weather	FSR100	9
Fingerprint Robustness Evaluation	System Prompts Pirate	FSR80	9
Fingerprint Robustness Evaluation	Active Output Translation	FSR0.4	9
Fingerprint Robustness Evaluation	System Prompts Robot	FSR1	9
Fingerprint Robustness Evaluation	System Prompts OAI	FSR100	9
Fingerprint Robustness Evaluation	Active Input Translation	FSR0.00e+0	9

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord