Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Scalable Equilibrium Sampling with Sequential Boltzmann Generators

About

Scalable sampling of molecular states in thermodynamic equilibrium is a long-standing challenge in statistical physics. Boltzmann generators tackle this problem by pairing normalizing flows with importance sampling to obtain uncorrelated samples under the target distribution. In this paper, we extend the Boltzmann generator framework with two key contributions, denoting our framework Sequential Boltzmann generators (SBG). The first is a highly efficient Transformer-based normalizing flow operating directly on all-atom Cartesian coordinates. In contrast to the equivariant continuous flows of prior methods, we leverage exactly invertible non-equivariant architectures which are highly efficient during both sample generation and likelihood evaluation. This efficiency unlocks more sophisticated inference strategies beyond standard importance sampling. In particular, we perform inference-time scaling of flow samples using a continuous-time variant of sequential Monte Carlo, in which flow samples are transported towards the target distribution with annealed Langevin dynamics. SBG achieves state-of-the-art performance w.r.t. all metrics on peptide systems, demonstrating the first equilibrium sampling in Cartesian coordinates of tri-, tetra- and hexa-peptides that were thus far intractable for prior Boltzmann generators.

Charlie B. Tan, Avishek Joey Bose, Chen Lin, Leon Klein, Michael M. Bronstein, Alexander Tong• 2025

Related benchmarks

TaskDatasetResultRank
Molecular SamplingTri-alanine (AL3) 10^4 samples (test)
E-W20.598
12
Molecular SamplingAlanine tetrapeptide (AL4) 10^4 samples (test)
E-W21.007
6
Molecular SamplingHexa-alanine (AL6) 10^4 samples (test)
E-W21.021
5
Molecular distribution modelingHexa-alanine
Cumulative Time57.5
4
Molecular distribution modelingAlanine Dipeptide
Total Time12.52
4
Molecular distribution modelingTri-alanine
Cumulative Training + Inference Time19.59
4
Molecular distribution modelingAlanine tetrapeptide
Cumulative Train + Inference Time32.17
4
Showing 7 of 7 rows

Other info

Follow for update