Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Contrastive Semantic Projection: Faithful Neuron Labeling with Contrastive Examples

About

Neuron labeling assigns textual descriptions to internal units of deep networks. Existing approaches typically rely on highly activating examples, often yielding broad or misleading labels by focusing on dominant but incidental visual factors. Prior work such as FALCON introduced contrastive examples -- inputs that are semantically similar to activating examples but elicit low activations -- to sharpen explanations, but it primarily addresses subspace-level interpretability rather than scalable neuron-level labeling. We revisit contrastive explanations for neuron-level labeling in two stages: (1) candidate label generation with vision language models (VLMs) and (2) label assignment with CLIP-like encoders. First, we show that providing contrastive image sets to VLMs yields candidate labels that are more specific and more faithful. Second, we introduce Contrastive Semantic Projection (CSP), an extension of SemanticLens that incorporates contrastive examples directly into its CLIP-based scoring and selection pipeline. Across extensive experiments and a case study on melanoma detection, contrastive labeling improves both faithfulness and semantic granularity over state-of-the-art baselines. Our results demonstrate that contrastive examples are a simple yet powerful and currently underutilized component of neuron labeling and analysis pipelines.

Oussama Bouanani, Jim Berend, Wojciech Samek, Sebastian Lapuschkin, Maximilian Dreyer• 2026

Related benchmarks

TaskDatasetResultRank
Neuron LabelingImageNet-1K
DMA62.17
60
Neuron LabelingResNet101 Neurons (evaluated)
AUC91
15
Neuron LabelingResNet50 evaluated neurons
AUC92
15
Neuron LabelingSAE-TopK Evaluated Neurons
AUC0.98
15
Neuron LabelingSAE Vanilla (evaluated neurons)
AUC0.8
15
Neuron LabelingISIC 2019
SCS Score25.33
15
Neuron Labeling FaithfulnessEvaluated Neurons ResNet50 and SAE-TopK
AUC90
15
Neuron LabelingResNet101 neurons
SCS Score23.54
15
Neuron LabelingResNet50 neurons
SCS Score23.4
15
Neuron LabelingSAE-TopK neurons
SCS Score31.91
15
Showing 10 of 11 rows

Other info

Follow for update