Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CLIP-Dissect: Automatic Description of Neuron Representations in Deep Vision Networks

About

In this paper, we propose CLIP-Dissect, a new technique to automatically describe the function of individual hidden neurons inside vision networks. CLIP-Dissect leverages recent advances in multimodal vision/language models to label internal neurons with open-ended concepts without the need for any labeled data or human examples. We show that CLIP-Dissect provides more accurate descriptions than existing methods for last layer neurons where the ground-truth is available as well as qualitatively good descriptions for hidden layer neurons. In addition, our method is very flexible: it is model agnostic, can easily handle new concepts and can be extended to take advantage of better multimodal models in the future. Finally CLIP-Dissect is computationally efficient and can label all neurons from five layers of ResNet-50 in just 4 minutes, which is more than 10 times faster than existing methods. Our code is available at https://github.com/Trustworthy-ML-Lab/CLIP-dissect. Finally, crowdsourced user study results are available at Appendix B to further support the effectiveness of our method.

Tuomas Oikarinen, Tsui-Wei Weng• 2022

Related benchmarks

TaskDatasetResultRank
Neuron LabelingImageNet-1K
DMA57.78
60
Neuron InterpretationImageNet-1k (val)
CLIP Cosine Similarity0.7942
18
Neuron descriptionImageNet
AUC95
15
Neuron LabelingISIC 2019
SCS Score22.88
15
Neuron LabelingResNet50 evaluated neurons
AUC89
15
Neuron LabelingSAE Vanilla (evaluated neurons)
AUC0.78
15
Neuron LabelingResNet50 neurons
SCS Score22.78
15
Neuron Labeling FaithfulnessEvaluated Neurons ResNet50 and SAE-TopK
AUC88
15
Neuron LabelingResNet101 Neurons (evaluated)
AUC88
15
Neuron LabelingResNet101 neurons
SCS Score22.84
15
Showing 10 of 31 rows

Other info

Follow for update