Measuring the Effect of Training Data on Deep Learning Predictions via Randomized Experiments

About

We develop a new, principled algorithm for estimating the contribution of training data points to the behavior of a deep learning model, such as a specific prediction it makes. Our algorithm estimates the AME, a quantity that measures the expected (average) marginal effect of adding a data point to a subset of the training data, sampled from a given distribution. When subsets are sampled from the uniform distribution, the AME reduces to the well-known Shapley value. Our approach is inspired by causal inference and randomized experiments: we sample different subsets of the training data to train multiple submodels, and evaluate each submodel's behavior. We then use a LASSO regression to jointly estimate the AME of each data point, based on the subset compositions. Under sparsity assumptions ($k \ll N$ datapoints have large AME), our estimator requires only $O(k\log N)$ randomized submodel trainings, improving upon the best prior Shapley value estimators.

Jinkun Lin, Anqi Zhang, Mathias Lecuyer, Jinyang Li, Aurojit Panda, Siddhartha Sen• 2022

Related benchmarks

Task	Dataset	Result
Classification	DIGITS (test)	Average Accuracy66.2	65
Corruption Detection (Feature-noise)	ImageNet100 (test)	F1-score18	56
Classification	Electricity (test)	Accuracy63.2	55
Noisy label detection	ImageNet-100	F1 Score58	54
Protected-attribute detection-gap evaluation	Adult	L^TPR5.5	14
Classification	election (test)	Mean Test Accuracy58.6	10
Classification	bbc-embeddings (test)	Mean Test Accuracy80.3	10
Classification	MINIBOONE (test)	Mean Test Accuracy69.1	10
Classification	2dplanes (test)	Mean Test Accuracy72.4	10
Classification	nomao (test)	Mean Test Accuracy77.8	10

Showing 10 of 19 rows

Other info

Follow for update

@wizwand_team Discord