Measuring the Effect of Training Data on Deep Learning Predictions via Randomized Experiments
About
We develop a new, principled algorithm for estimating the contribution of training data points to the behavior of a deep learning model, such as a specific prediction it makes. Our algorithm estimates the AME, a quantity that measures the expected (average) marginal effect of adding a data point to a subset of the training data, sampled from a given distribution. When subsets are sampled from the uniform distribution, the AME reduces to the well-known Shapley value. Our approach is inspired by causal inference and randomized experiments: we sample different subsets of the training data to train multiple submodels, and evaluate each submodel's behavior. We then use a LASSO regression to jointly estimate the AME of each data point, based on the subset compositions. Under sparsity assumptions ($k \ll N$ datapoints have large AME), our estimator requires only $O(k\log N)$ randomized submodel trainings, improving upon the best prior Shapley value estimators.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Classification | DIGITS (test) | Average Accuracy66.2 | 59 | |
| Classification | Electricity (test) | Accuracy63.2 | 55 | |
| Classification | election (test) | Mean Test Accuracy58.6 | 10 | |
| Classification | bbc-embeddings (test) | Mean Test Accuracy80.3 | 10 | |
| Classification | MINIBOONE (test) | Mean Test Accuracy69.1 | 10 | |
| Classification | 2dplanes (test) | Mean Test Accuracy72.4 | 10 | |
| Classification | nomao (test) | Mean Test Accuracy77.8 | 10 | |
| Classification | fried (test) | Mean Test Accuracy70.2 | 10 | |
| Data Selection | DIGITS (test) | -- | 1 |