Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Show and Tell: Visually Explainable Deep Neural Nets via Spatially-Aware Concept Bottleneck Models

About

Modern deep neural networks have now reached human-level performance across a variety of tasks. However, unlike humans they lack the ability to explain their decisions by showing where and telling what concepts guided them. In this work, we present a unified framework for transforming any vision neural network into a spatially and conceptually interpretable model. We introduce a spatially-aware concept bottleneck layer that projects "black-box" features of pre-trained backbone models into interpretable concept maps, without requiring human labels. By training a classification layer over this bottleneck, we obtain a self-explaining model that articulates which concepts most influenced its prediction, along with heatmaps that ground them in the input image. Accordingly, we name this method "Spatially-Aware and Label-Free Concept Bottleneck Model" (SALF-CBM). Our results show that the proposed SALF-CBM: (1) Outperforms non-spatial CBM methods, as well as the original backbone, on a variety of classification tasks; (2) Produces high-quality spatial explanations, outperforming widely used heatmap-based methods on a zero-shot segmentation task; (3) Facilitates model exploration and debugging, enabling users to query specific image regions and refine the model's decisions by locally editing its concept maps.

Itay Benou, Tammy Riklin-Raviv• 2025

Related benchmarks

TaskDatasetResultRank
Image ClassificationImageNet--
429
Image ClassificationImageNet (test)
Top-1 Accuracy76.3
291
Image ClassificationCUB-200
Accuracy76.21
92
Image ClassificationPlaces 365
Top-1 Acc49.38
17
Image ClassificationPlaces365 (test)
Accuracy49.4
9
SegmentationImageNet (val)
Pixel Accuracy76.94
7
Concept interpretabilityCoco-Stuff (test)
Locality19.5
5
Concept interpretabilityPartImageNet (test)
Locality16.4
5
Showing 8 of 8 rows

Other info

Code

Follow for update