Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ART for Diffusion Sampling: A Reinforcement Learning Approach to Timestep Schedule

About

We consider time discretization for score-based diffusion models to generate samples from a learned reverse-time dynamic on a finite grid. Uniform and hand-crafted grids can be suboptimal given a budget on the number of time steps. We introduce Adaptive Reparameterized Time (ART) that controls the clock speed of a reparameterized time variable, leading to a time change and uneven timesteps along the sampling trajectory while preserving the terminal time. The objective is to minimize the aggregate error arising from the discretized Euler scheme. We derive a randomized control companion, ART-RL, and formulate time change as a continuous-time reinforcement learning (RL) problem with Gaussian policies. We then prove that solving ART-RL recovers the optimal ART schedule, which in turn enables practical actor--critic updates to learn the latter in a data-driven way. Empirically, based on the official EDM pipeline, ART-RL improves Fr\'echet Inception Distance on CIFAR-10 over a wide range of budgets and transfers to AFHQv2, FFHQ, and ImageNet without the need of retraining.

Yilie Huang, Wenpin Tang, Xunyu Zhou• 2026

Related benchmarks

TaskDatasetResultRank
Image GenerationCIFAR-10 (test)
FID1.85
471
Image GenerationFFHQ (test)
FID2.67
21
Image GenerationAFHQ v2 (test)
FID2.1
10
Image SynthesisImageNet
FID2.57
10
Showing 4 of 4 rows

Other info

Follow for update