Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Can Microcanonical Langevin Dynamics Leverage Mini-Batch Gradient Noise?

About

Scaling inference methods such as Markov chain Monte Carlo to high-dimensional models remains a central challenge in Bayesian deep learning. A promising recent proposal, microcanonical Langevin Monte Carlo, has shown state-of-the-art performance across a wide range of problems. However, its reliance on full-dataset gradients makes it prohibitively expensive for large-scale problems. This paper addresses a fundamental question: Can microcanonical dynamics effectively leverage mini-batch gradient noise? We provide the first systematic study of this problem, establishing a novel continuous-time theoretical analysis of stochastic-gradient microcanonical dynamics. We reveal two critical failure modes: a theoretically derived bias due to anisotropic gradient noise and numerical instabilities in complex high-dimensional posteriors. To tackle these issues, we propose a principled gradient noise preconditioning scheme shown to significantly reduce this bias and develop a novel, energy-variance-based adaptive tuner that automates step size selection and dynamically informs numerical guardrails. The resulting algorithm is a robust and scalable microcanonical Monte Carlo sampler that achieves state-of-the-art performance on challenging high-dimensional inference tasks like Bayesian neural networks. Combined with recent ensemble techniques, our work unlocks a new class of stochastic microcanonical Langevin ensemble (SMILE) samplers for large-scale Bayesian inference.

Emanuel Sommer, Kangning Diao, Jakob Robnik, Uros Seljak, David R\"ugamer• 2026

Related benchmarks

TaskDatasetResultRank
Character-level Language ModelingShakespeare modern
Accuracy55.62
48
RegressionBikesharing (3 train-test splits)
LPPD0.132
7
RegressionEnergy (3 train-test splits)
LPPD2.018
7
RegressionAirfoil (3 train-test splits)
LPPD0.28
7
Image ClassificationCIFAR-10
LPPD-0.2763
5
Image ClassificationCIFAR-10
LPPD-0.2763
5
Image ClassificationImagenette
LPPD-0.7708
4
Showing 7 of 7 rows

Other info

Follow for update