Accumulated Aggregated D-Optimal Designs for Estimating Main Effects in Black-Box Models
About
Estimating how individual input variables affect the output of a black-box model is a central task in explainable machine learning. However, existing methods suffer from two key limitations: sensitivity to out-of-distribution (OOD) evaluations, which arises when query points are placed far from the data manifold, and instability under feature correlation, which can lead to unreliable effect estimates in practice. We introduce a unified view of main effect estimation as a design problem, which reveals that all existing methods differ only in their choice of evaluation locations. Building on this formulation, we propose A2D2E, an Estimator based on Accumulated Aggregated D-Optimal Designs, which replaces evaluations with a D-optimal hypercube design to minimize the variance of main effect estimation. A2D2E is model-agnostic, requires no differentiability of the predictor, and admits a closed-form estimator with complexity comparable to existing approaches. We establish that A2D2E is consistent to the same population target as ALE, and extend this result to the realistic setting where only a surrogate model is available. Through extensive simulations across multiple predictive models and dependence settings, we demonstrate that A2D2E outperforms ALE-based methods, with the largest gains under high feature correlation.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| All-variables main effect estimation | Synthetic Unif[0, 1]^p n=500 (train) | Mean Wall-Clock Time (s)0.071 | 15 | |
| Single-variable main effect estimation | Synthetic Unif[0, 1]^p (p=4) (train) | Mean Wall-Clock Time (s)0.048 | 15 | |
| K-NN Regression | Additive benchmark functions Low dependence | f0 Mean ORMSE0.0834 | 5 | |
| K-NN Regression | Additive benchmark functions High dependence | Mean ORMSE (f0)0.0907 | 5 | |
| K-NN Regression | Additive benchmark functions Independent dependence | f0 Mean ORMSE0.0886 | 5 | |
| Main-effect function estimation | branin Low dependence Synthetic | ORMSE0.434 | 3 | |
| Main-effect function estimation | branin High dependence Synthetic | ORMSE0.732 | 3 | |
| Main-effect function estimation | simple Low dependence Synthetic | ORMSE0.062 | 3 | |
| Main-effect function estimation | simple High dependence Synthetic | ORMSE0.172 | 3 | |
| Main-effect function estimation | franke Low dependence Synthetic | ORMSE0.114 | 3 |