Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ILRR: Inference-Time Steering Method for Masked Diffusion Language Models

About

Discrete Diffusion Language Models (DLMs) offer a promising non-autoregressive alternative for text generation, yet effective mechanisms for inference-time control remain relatively underexplored. Existing approaches include sampling-level guidance procedures or trajectory optimization mechanisms. In this work, we introduce Iterative Latent Representation Refinement (ILRR), a learning-free framework for steering DLMs using a single reference sequence. ILRR guides generation by dynamically aligning the internal activations of the generated sequence with those of a given reference throughout the denoising process. This approach captures and transfers high-level semantic properties, with a tunable steering scale enabling flexible control over attributes such as sentiment. We further introduce Spatially Modulated Steering, an extension that enables steering long texts using shorter references by regulating guidance intensity across the sequence. Empirically, we demonstrate that ILRR achieves effective attribute steering on LLaDA and MDLM architectures with a minor computational overhead, requiring only one additional parallel forward pass per denoising step. Under the same compute budget, ILRR improves attribute accuracy over comparable baselines by 10$\%$ to 60$\%$ points, while maintaining high generation quality.

Eden Avrahami, Eliya Nachmani• 2026

Related benchmarks

TaskDatasetResultRank
Sentiment Steering15 prefix prompts length 50
Sentiment Accuracy100
11
Toxicity Steering15 prefix prompts length 50
Toxicity Accuracy71.2
11
Sentiment SteeringMDLM long sequence generation 512 length (test)
Steering Accuracy61.7
6
Toxicity SteeringMDLM long sequence generation 512 length (test)
Steering Accuracy13.1
6
Showing 4 of 4 rows

Other info

Follow for update