Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

IPGuard: Protecting Intellectual Property of Deep Neural Networks via Fingerprinting the Classification Boundary

About

A deep neural network (DNN) classifier represents a model owner's intellectual property as training a DNN classifier often requires lots of resource. Watermarking was recently proposed to protect the intellectual property of DNN classifiers. However, watermarking suffers from a key limitation: it sacrifices the utility/accuracy of the model owner's classifier because it tampers the classifier's training or fine-tuning process. In this work, we propose IPGuard, the first method to protect intellectual property of DNN classifiers that provably incurs no accuracy loss for the classifiers. Our key observation is that a DNN classifier can be uniquely represented by its classification boundary. Based on this observation, IPGuard extracts some data points near the classification boundary of the model owner's classifier and uses them to fingerprint the classifier. A DNN classifier is said to be a pirated version of the model owner's classifier if they predict the same labels for most fingerprinting data points. IPGuard is qualitatively different from watermarking. Specifically, IPGuard extracts fingerprinting data points near the classification boundary of a classifier that is already trained, while watermarking embeds watermarks into a classifier during its training or fine-tuning process. We extensively evaluate IPGuard on CIFAR-10, CIFAR-100, and ImageNet datasets. Our results show that IPGuard can robustly identify post-processed versions of the model owner's classifier as pirated versions of the classifier, and IPGuard can identify classifiers, which are not the model owner's classifier nor its post-processed versions, as non-pirated versions of the classifier.

Xiaoyu Cao, Jinyuan Jia, Neil Zhenqiang Gong• 2019

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-100
Accuracy77.95
109
Training Data Provenance VerificationCIFAR10
Avg AUC67.9
27
Image ClassificationCIFAR-10
Accuracy92.98
24
Ownership VerificationModel Extraction Setting Surrogate Models
AUC67.9
24
Model Lineage AttestationCaltech101 -> Caltech101 -> CIFAR100
TPR20
20
Model Lineage AttestationCIFAR100 Dogs
TPR16
10
Model Lineage AttestationCaltech101->TinyImageNet Mixed
TPR18
10
Model Lineage AttestationTinyImageNet->Pet->Mixed
TPR14
10
Model Lineage AttestationFlowers CIFAR100 Mixed Dataset
TPR0.13
10
Showing 9 of 9 rows

Other info

Follow for update