Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning-Order Autoregressive Models with Application to Molecular Graph Generation

About

Autoregressive models (ARMs) have become the workhorse for sequence generation tasks, since many problems can be modeled as next-token prediction. While there appears to be a natural ordering for text (i.e., left-to-right), for many data types, such as graphs, the canonical ordering is less obvious. To address this problem, we introduce a variant of ARM that generates high-dimensional data using a probabilistic ordering that is sequentially inferred from data. This model incorporates a trainable probability distribution, referred to as an order-policy, that dynamically decides the autoregressive order in a state-dependent manner. To train the model, we introduce a variational lower bound on the log-likelihood, which we optimize with stochastic gradient estimation. We demonstrate experimentally that our method can learn meaningful autoregressive orderings in image and graph generation. On the challenging domain of molecular graph generation, we achieve state-of-the-art results on the QM9 and ZINC250k benchmarks, evaluated across key metrics for distribution similarity and drug-likeless.

Zhe Wang, Jiaxin Shi, Nicolas Heess, Arthur Gretton, Michalis K. Titsias• 2025

Related benchmarks

TaskDatasetResultRank
Molecular GenerationZINC 250K
FCD3.229
45
Molecular GenerationQM9
Validity99.85
26
Unconditional Molecule GenerationGuacaMol SMILES (test)
Validity89.7
10
Unconditional molecular generationZINC 250K
Validity96.1
9
Unconditional Molecule GenerationQM9 implicit hydrogens (80/10/10)
Validity0.999
8
Star Graph PlanningStar Graph Planning
Sequence Accuracy30.1
6
Maze PlanningImperfect Maze Planning Easy
Accuracy79.8
6
Maze PlanningPerfect Maze Planning Easy
Accuracy13.8
6
Maze PlanningPerfect Maze Planning Medium
Accuracy32
6
Maze PlanningPerfect Maze Planning Hard
Accuracy25.2
6
Showing 10 of 15 rows

Other info

Follow for update