Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Can I Have Your Order? Monte-Carlo Tree Search for Slot Filling Ordering in Diffusion Language Models

About

While plan-and-infill decoding in Masked Diffusion Models (MDMs) shows promise for mathematical and code reasoning, performance remains highly sensitive to slot infilling order, often yielding substantial output variance. We introduce McDiffuSE, a framework that formulates slot selection as decision making and optimises infilling orders through Monte Carlo Tree Search (MCTS). McDiffuSE uses look-ahead simulations to evaluate partial completions before commitment, systematically exploring the combinatorial space of generation orders. Experiments show an average improvement of 3.2% over autoregressive baselines and 8.0% over baseline plan-and-infill, with notable gains of 19.5% on MBPP and 4.9% on MATH500. Our analysis reveals that while McDiffuSE predominantly follows sequential ordering, incorporating non-sequential generation is essential for maximising performance. We observe that larger exploration constants, rather than increased simulations, are necessary to overcome model confidence biases and discover effective orderings. These findings establish MCTS-based planning as an effective approach for enhancing generation quality in MDMs.

Joshua Ong Jun Leang, Yu Zhao, Mihaela C\u{a}t\u{a}lina Stoian, Wenda Li, Shay B. Cohen, Eleonora Giunchiglia• 2026

Related benchmarks

TaskDatasetResultRank
Code GenerationHumanEval
Pass@178.37
850
Mathematical ReasoningGSM8K (test)
Accuracy90.63
797
Mathematical ReasoningMATH500 (test)
Accuracy54.6
381
Mathematical ReasoningGSM8K--
351
Code GenerationMBPP
Pass@173.57
113
Question AnsweringGPQA (test)
Accuracy37.19
55
Question AnsweringGPQA Diamond
Pass@134.78
49
Question AnsweringARC
pass@188.68
30
Showing 8 of 8 rows

Other info

Follow for update