Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Concepts Worth Having: Refining VLM-Guided Concept Bottleneck Models with Minimal Annotations

About

Concept-bottleneck models (CBMs) are neural classifiers that compute predictions from high-level concepts extracted from the input. CBMs ensure stakeholders can understand the concepts -- and the predictions they entail -- by learning these from concept-level annotations, which are however seldom available. Recent CBM architectures work around this issue by obtaining annotations from Vision-Language Models (VLMs). While greatly broadening applicability, doing so can yield lower quality concepts and therefore less interpretable models. We strike for a middle ground by introducing Vision-plus-Human-guided CBM (VH-CBM), a hybrid approach that exploits both VLMs and a small amount of dense annotations. VH-CBM employs a Gaussian Process in the VLM's embedding space, which captures useful global information about the target domain, to propagate the expert's supervision to any target data point. Our empirical evaluation shows how VH-CBM predicts more accurate concepts than VLM-guided CBMs even when annotating as little as 1% of the data, while sporting better concept calibration and supporting active learning.

Nicola Debole, Andrea Passerini, Stefano Teso, Andrea Pugnana, Emanuele Marconato• 2026

Related benchmarks

TaskDatasetResultRank
Concept-based ClassificationDerma
F1 Score (Y)69
14
Concept-based ClassificationCUB
F1 (Target Y)88
14
Concept-based ClassificationShapes3D
F1 (Y)99
14
Concept-based ClassificationCelebA
F1 (Y)98
14
Showing 4 of 4 rows

Other info

Follow for update