CREDIT: Certified Ownership Verification of Deep Neural Networks Against Model Extraction Attacks

About

Machine Learning as a Service (MLaaS) has emerged as a widely adopted paradigm for providing access to deep neural network (DNN) models, enabling users to conveniently leverage these models through standardized APIs. However, such services are highly vulnerable to Model Extraction Attacks (MEAs), where an adversary repeatedly queries a target model to collect input-output pairs and uses them to train a surrogate model that closely replicates its functionality. While numerous defense strategies have been proposed, verifying the ownership of a suspicious model with strict theoretical guarantees remains a challenging task. To address this gap, we introduce CREDIT, a certified ownership verification against MEAs. Specifically, we employ mutual information to quantify the similarity between DNN models, propose a practical verification threshold, and provide rigorous theoretical guarantees for ownership verification based on this threshold. We extensively evaluate our approach on several mainstream datasets across different domains and tasks, achieving state-of-the-art performance. Our implementation is publicly available at: https://github.com/LabRAI/CREDIT.

Bolin Shen, Zhan Cheng, Neil Zhenqiang Gong, Fan Yao, Yushun Dong• 2026

Related benchmarks

Task	Dataset	Result
Graph Classification	PROTEINS	Accuracy74.73	1383
Image Classification	CIFAR-100	Accuracy79.9	435
Graph Classification	ENZYMES	Accuracy46.94	419
Training Data Provenance Verification	CIFAR10	Avg AUC100	27
Ownership Verification	Model Extraction Setting Surrogate Models	AUC100	24
Image Classification	CIFAR-10	Accuracy94.67	24

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord