Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Spotlight Your Instructions: Instruction-following with Dynamic Attention Steering

About

In many real-world applications, users rely on natural language instructions to guide large language models (LLMs) across a wide range of tasks. These instructions are often complex, diverse, and subject to frequent change. However, LLMs do not always attend to these instructions reliably, and users lack simple mechanisms to emphasize their importance beyond modifying prompt wording or structure. To address this, we present an inference-time method that enables users to emphasize specific parts of their prompt by steering the model's attention toward them, aligning the model's perceived importance of different prompt tokens with user intent. Unlike prior approaches that are limited to static instructions, require significant offline profiling, or rely on fixed biases, we dynamically update the proportion of model attention given to the user-specified parts--ensuring improved instruction following without performance degradation. We demonstrate that our approach improves instruction following across a variety of tasks involving multiple instructions and generalizes across models of varying scales.

Praveen Venkateswaran, Danish Contractor• 2025

Related benchmarks

TaskDatasetResultRank
Instruction FollowingIFEval--
625
SteeringEmotion
Steering Success92.7
11
SteeringAI Persona
Steering Success79
11
SteeringToxicity
Steering Success58
11
SteeringQA
Steering Success58
11
SteeringJailbreak
Steering Success2.5
11
Mathematical ReasoningGSM8K-Format
Final Accuracy98.8
9
Long-context Instruction FollowingLIFBench
List Score61.4
9
Showing 8 of 8 rows

Other info

Follow for update