Local MDI+: Local Feature Importances for Tree-Based Models

About

Tree-based ensembles such as random forests remain the go-to for tabular data over deep learning models due to their prediction performance and computational efficiency. These advantages have led to their widespread deployment in high-stakes domains, where interpretability is essential for ensuring trustworthy predictions. This has motivated the development of popular local feature importance methods such as LIME and TreeSHAP. However, these approaches rely on approximations that ignore the model's internal structure and instead depend on potentially unstable perturbations. These issues are addressed in the global setting by MDI+, a global feature importance method which combines tree-based and linear feature importances by exploiting an equivalence between decision trees and least squares on a transformed node basis. However, the global MDI+ scores are not able to explain predictions when faced with heterogeneous individual characteristics. To address this gap, we propose Local MDI+ (LMDI+), a novel extension of the MDI+ framework that quantifies feature importances for each particular sample. Across twelve real-world benchmark datasets, LMDI+ outperforms existing baselines at identifying instance-specific predictive features, yielding an average 10% improvement in predictive performance when using only the selected features. It further demonstrates greater stability by consistently producing similar instance-level feature importance rankings across repeated model fits with different random seeds. Ablation experiments show that each component of LMDI+ contributes to these gains, and that the improvements extend beyond random forests to gradient boosting models. Finally, we show that LMDI+ enables local interpretability use cases by identifying closely matched counterfactuals for each classification benchmark and discovering homogeneous subgroups in a housing dataset case study.

Zhongyuan Liang, Zachary T. Rewolinski, Abhineet Agarwal, Tiffany M. Tang, Bin Yu• 2025

Related benchmarks

Task	Dataset	Result
Local Feature Importance Evaluation	12 Real-World Datasets Aggregate (test)	Average Rank1	16
Counterfactual Explanations	Spam (test)	--	16
Classification	Pol N=10,082 (full)	AUROC0.9959	9
Classification	House 16H (full)	AUROC93.1	9
Classification	Miami Housing N=13,932 (full)	AUROC0.8603	9
Regression	Super Conductivity N=21,263 (full)	R2 Score0.8268	9
Regression	SARCOS N=48,933 (full)	R2 Score0.9643	9
Regression	Wave Energy N=72,000 (full)	R2 Score0.8521	9
Counterfactual Explanation	House 16H (test)	Mean L1 Distance7.6	4
Counterfactual Explanation	Higgs (test)	Mean L1 Distance18.2	4

Showing 10 of 13 rows

Other info

Follow for update

@wizwand_team Discord