Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Neural Basis Models for Interpretability

About

Due to the widespread use of complex machine learning models in real-world applications, it is becoming critical to explain model predictions. However, these models are typically black-box deep neural networks, explained post-hoc via methods with known faithfulness limitations. Generalized Additive Models (GAMs) are an inherently interpretable class of models that address this limitation by learning a non-linear shape function for each feature separately, followed by a linear model on top. However, these models are typically difficult to train, require numerous parameters, and are difficult to scale. We propose an entirely new subfamily of GAMs that utilizes basis decomposition of shape functions. A small number of basis functions are shared among all features, and are learned jointly for a given task, thus making our model scale much better to large-scale data with high-dimensional features, especially when features are sparse. We propose an architecture denoted as the Neural Basis Model (NBM) which uses a single neural network to learn these bases. On a variety of tabular and image datasets, we demonstrate that for interpretable machine learning, NBMs are the state-of-the-art in accuracy, model size, and, throughput and can easily model all higher-order feature interactions. Source code is available at https://github.com/facebookresearch/nbm-spam.

Filip Radenovic, Abhimanyu Dubey, Dhruv Mahajan• 2022

Related benchmarks

TaskDatasetResultRank
ClassificationCUB (test)
Accuracy77.7
79
ClassificationCredit
ROCAUC98.5
63
RegressionCalifornia Housing (CH) (test)--
52
RegressionHousing
RMSE0.478
32
RegressionADNI clinical tabular dataset
Average MSE0.182
27
RegressionCME clinical tabular
Average MSE0.613
27
RegressionAirbnb listings
Average MSE0.441
27
Binary ClassificationMIMIC 2
AUC0.869
25
RegressionYear
MSE79.01
25
Binary ClassificationFICO (test)
AUROC80.48
20
Showing 10 of 31 rows

Other info

Code

Follow for update