Concepts Worth Having: Refining VLM-Guided Concept Bottleneck Models with Minimal Annotations

About

Concept-bottleneck models (CBMs) are neural classifiers that compute predictions from high-level concepts extracted from the input. CBMs ensure stakeholders can understand the concepts -- and the predictions they entail -- by learning these from concept-level annotations, which are however seldom available. Recent CBM architectures work around this issue by obtaining annotations from Vision-Language Models (VLMs). While greatly broadening applicability, doing so can yield lower quality concepts and therefore less interpretable models. We strike for a middle ground by introducing Vision-plus-Human-guided CBM (VH-CBM), a hybrid approach that exploits both VLMs and a small amount of dense annotations. VH-CBM employs a Gaussian Process in the VLM's embedding space, which captures useful global information about the target domain, to propagate the expert's supervision to any target data point. Our empirical evaluation shows how VH-CBM predicts more accurate concepts than VLM-guided CBMs even when annotating as little as 1% of the data, while sporting better concept calibration and supporting active learning.

Nicola Debole, Andrea Passerini, Stefano Teso, Andrea Pugnana, Emanuele Marconato• 2026

Related benchmarks

Task	Dataset	Result
Concept-based Classification	Derma	F1 Score (Y)69	14
Concept-based Classification	CUB	F1 (Target Y)88	14
Concept-based Classification	Shapes3D	F1 (Y)99	14
Concept-based Classification	CelebA	F1 (Y)98	14

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord