Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

From Heads to Neurons: Causal Attribution and Steering in Multi-Task Vision-Language Models

About

Recent work has increasingly explored neuron-level interpretation in vision-language models (VLMs) to identify neurons critical to final predictions. However, existing neuron analyses generally focus on single tasks, limiting the comparability of neuron importance across tasks. Moreover, ranking strategies tend to score neurons in isolation, overlooking how task-dependent information pathways shape the write-in effects of feed-forward network (FFN) neurons. This oversight can exacerbate neuron polysemanticity in multi-task settings, introducing noise into the identification and intervention of task-critical neurons. In this study, we propose HONES (Head-Oriented Neuron Explanation & Steering), a gradient-free framework for task-aware neuron attribution and steering in multi-task VLMs. HONES ranks FFN neurons by their causal write-in contributions conditioned on task-relevant attention heads, and further modulates salient neurons via lightweight scaling. Experiments on four diverse multimodal tasks and two popular VLMs show that HONES outperforms existing methods in identifying task-critical neurons and improves model performance after steering. Our source code is released at: https://github.com/petergit1/HONES.

Qidong Wang, Junjie Hu, Ming Jiang• 2026

Related benchmarks

TaskDatasetResultRank
Visual Question AnsweringGQA
Accuracy64.1
1425
Visual Question AnsweringVQA
Accuracy69.07
66
Text-based Visual Question AnsweringTextVQA
ANLS60.7
33
Vision-Language Multi-task EvaluationMS COCO Unified Multi-task (test)
VQA Score36.5
24
Image-Text RetrievalRetrieval
Avg Recall64.03
23
Optical Character RecognitionOCR
Average Score64.03
20
Visual Question AnsweringGQA OOD (test)--
20
Image CaptioningCaption
BLEU-423.35
12
Visual Question AnsweringVQAv2 3K curated MS COCO (test)
Relative Performance Drop (%)27.3
10
Image CaptioningFlickr30K
BLEU-416.9
8
Showing 10 of 20 rows

Other info

Follow for update