Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Variational Learning for Insertion-based Generation

About

Non-monotonic sequence generation methods, such as masked diffusion models, provide a flexible alternative to left-to-right autoregressive modeling by allowing tokens to be generated in non-fixed and prescribed orders. Despite their practical advantages, most existing non-monotonic models are order-agnostic and rely on a fixed-length grid, limiting their ability to support variable-length generation and adaptive insertion order. In this work, we introduce a probabilistic framework for learning insertion order in variable-length insertion models. We formalize a bijective correspondence between insertion trajectories and permutations, which enables an exact reparameterization of the data likelihood as a sum over permutations. Building on this result, we propose the Insertion Process (IP), a stochastic generative model that jointly learns where to insert, what to insert, and when to terminate, trained via permutation-based variational inference. Unlike prior fixed-canvas approaches, IP natively supports variable-length generation and learns data-driven preferences over insertion orders. Experiments on goal-conditioned planning and molecular string generation demonstrate that learning insertion order improves both modeling quality and generalization in domains without a canonical left-to-right structure.

Yangtian Zhang, Zhe Wang, Arthur Gretton, Rex Ying, David van Dijk, Michalis K. Titsias, Jiaxin Shi• 2026

Related benchmarks

TaskDatasetResultRank
Unconditional Molecule GenerationGuacaMol SMILES (test)
Validity97.4
10
Maze PlanningBraided Maze Planning Easy
Accuracy100
6
Maze PlanningBraided Maze Planning Medium
Accuracy1
6
Maze PlanningBraided Maze Planning Hard
Accuracy99.9
6
Maze PlanningImperfect Maze Planning Easy
Accuracy100
6
Maze PlanningImperfect Maze Planning Medium
Accuracy98.4
6
Maze PlanningImperfect Maze Planning Hard
Accuracy95.3
6
Maze PlanningPerfect Maze Planning Easy
Accuracy99.4
6
Maze PlanningPerfect Maze Planning Medium
Accuracy98.1
6
Maze PlanningPerfect Maze Planning Hard
Accuracy97.9
6
Showing 10 of 11 rows

Other info

Follow for update