Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

NEAT: Neighborhood-Guided, Efficient, Autoregressive Set Transformer for 3D Molecular Generation

About

Transformer-based autoregressive models offer an efficient alternative to diffusion- and flow-matching-based approaches for generating 3D molecules. One challenge remains: standard transformer architectures require a sequential ordering of tokens, which is not inherently defined for the atoms in a molecule. Prior works have addressed this by using canonical atom orderings. However, these approaches are not permutation invariant w.r.t. atoms and bias next-token prediction towards ordering conventions. We overcome this limitation by introducing a novel neighborhood-guided training strategy. Our model, NEAT (Neighborhood-Guided, Efficient, Autoregressive Set Transformer) treats molecular graphs as sets of atoms and learns an order-agnostic distribution over admissible tokens at the graph boundary, thereby ensuring atom-level permutation invariance. NEAT achieves state-of-the-art generation quality on the QM9 and GEOM-Drugs datasets while offering a significant speed advantage over existing baselines.

Daniel Rose, Roxane Axel Jacob, Johannes Kirchmair, Thierry Langer• 2025

Related benchmarks

TaskDatasetResultRank
3D Molecule GenerationQM9 (test)
Validity94.4
64
Unconditional 3D Molecular GenerationGEOM (test)
Atom Stability81.1
13
3D Molecular Prefix CompletionGEOM
Atom Stable0.811
2
Showing 3 of 3 rows

Other info

Follow for update