ILRR: Inference-Time Steering Method for Masked Diffusion Language Models

About

Discrete Diffusion Language Models (DLMs) offer a promising non-autoregressive alternative for text generation, yet effective mechanisms for inference-time control remain relatively underexplored. Existing approaches include sampling-level guidance procedures or trajectory optimization mechanisms. In this work, we introduce Iterative Latent Representation Refinement (ILRR), a learning-free framework for steering DLMs using a single reference sequence. ILRR guides generation by dynamically aligning the internal activations of the generated sequence with those of a given reference throughout the denoising process. This approach captures and transfers high-level semantic properties, with a tunable steering scale enabling flexible control over attributes such as sentiment. We further introduce Spatially Modulated Steering, an extension that enables steering long texts using shorter references by regulating guidance intensity across the sequence. Empirically, we demonstrate that ILRR achieves effective attribute steering on LLaDA and MDLM architectures with a minor computational overhead, requiring only one additional parallel forward pass per denoising step. Under the same compute budget, ILRR improves attribute accuracy over comparable baselines by 10$\%$ to 60$\%$ points, while maintaining high generation quality.

Eden Avrahami, Eliya Nachmani• 2026

Related benchmarks

Task	Dataset	Result
Sentiment Steering	15 prefix prompts length 50	Sentiment Accuracy100	11
Toxicity Steering	15 prefix prompts length 50	Toxicity Accuracy71.2	11
Sentiment Steering	MDLM long sequence generation 512 length (test)	Steering Accuracy61.7	6
Toxicity Steering	MDLM long sequence generation 512 length (test)	Steering Accuracy13.1	6

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord