Show and Tell: Visually Explainable Deep Neural Nets via Spatially-Aware Concept Bottleneck Models

About

Modern deep neural networks have now reached human-level performance across a variety of tasks. However, unlike humans they lack the ability to explain their decisions by showing where and telling what concepts guided them. In this work, we present a unified framework for transforming any vision neural network into a spatially and conceptually interpretable model. We introduce a spatially-aware concept bottleneck layer that projects "black-box" features of pre-trained backbone models into interpretable concept maps, without requiring human labels. By training a classification layer over this bottleneck, we obtain a self-explaining model that articulates which concepts most influenced its prediction, along with heatmaps that ground them in the input image. Accordingly, we name this method "Spatially-Aware and Label-Free Concept Bottleneck Model" (SALF-CBM). Our results show that the proposed SALF-CBM: (1) Outperforms non-spatial CBM methods, as well as the original backbone, on a variety of classification tasks; (2) Produces high-quality spatial explanations, outperforming widely used heatmap-based methods on a zero-shot segmentation task; (3) Facilitates model exploration and debugging, enabling users to query specific image regions and refine the model's decisions by locally editing its concept maps.

Itay Benou, Tammy Riklin-Raviv• 2025

Related benchmarks

Task	Dataset	Result
Image Classification	ImageNet	--	431
Image Classification	ImageNet (test)	Top-1 Accuracy76.3	299
Image Classification	CUB-200	Accuracy76.21	117
Image Classification	Places 365	Top-1 Acc49.38	27
Image Classification	Places365 (test)	Accuracy49.4	9
Segmentation	ImageNet (val)	Pixel Accuracy76.94	7
Concept interpretability	Coco-Stuff (test)	Locality19.5	5
Concept interpretability	PartImageNet (test)	Locality16.4	5

Showing 8 of 8 rows

Other info

Code

Follow for update

@wizwand_team Discord