Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

B\'ezierFlow: Learning B\'ezier Stochastic Interpolant Schedulers for Few-Step Generation

About

We introduce B\'ezierFlow, a lightweight training approach for few-step generation with pretrained diffusion and flow models. B\'ezierFlow achieves a 2-3x performance improvement for sampling with $\leq$ 10 NFEs while requiring only 15 minutes of training. Recent lightweight training approaches have shown promise by learning optimal timesteps, but their scope remains restricted to ODE discretizations. To broaden this scope, we propose learning the optimal transformation of the sampling trajectory by parameterizing stochastic interpolant (SI) schedulers. The main challenge lies in designing a parameterization that satisfies critical desiderata, including boundary conditions, differentiability, and monotonicity of the SNR. To effectively meet these requirements, we represent scheduler functions as B\'ezier functions, where control points naturally enforce these properties. This reduces the problem to learning an ordered set of points in the time range, while the interpretation of the points changes from ODE timesteps to B\'ezier control points. Across a range of pretrained diffusion and flow models, B\'ezierFlow consistently outperforms prior timestep-learning methods, demonstrating the effectiveness of expanding the search space from discrete timesteps to B\'ezier-based trajectory transformations.

Yunhong Min, Juil Koo, Seungwoo Yoo, Minhyuk Sung• 2025

Related benchmarks

TaskDatasetResultRank
Image GenerationFFHQ 64x64 (test)
FID33.72
69
Unconditional Layout GenerationRico
FID2.96
55
Image GenerationCIFAR-10 32x32 with ReFlow (test)
FID3.74
48
Image GenerationImageNet 256x256 with FlowDCN (val)
FID5.94
48
Image GenerationMS-COCO 512x512 with Stable Diffusion (val)
FID11.02
48
Text-to-Image GenerationMS-COCO zero-shot 512 x 512 Stable Diffusion v3.5
CLIP Score0.263
48
Image GenerationCIFAR-10 32x32
FID2.09
44
Unconditional 3D Point Cloud GenerationShapeNet airplane (val)
MMD0.53
40
Image GenerationCIFAR-10 32x32 EDM (test)
FID22.2
24
Image GenerationAFHQ 64x64 v2 EDM (test)
FID26.31
24
Showing 10 of 13 rows

Other info

Follow for update