AlphaFlow: Understanding and Improving MeanFlow Models

About

MeanFlow has recently emerged as a powerful framework for few-step generative modeling trained from scratch, but its success is not yet fully understood. In this work, we show that the MeanFlow objective naturally decomposes into two parts: trajectory flow matching and trajectory consistency. Through gradient analysis, we find that these terms are strongly negatively correlated, causing optimization conflict and slow convergence. Motivated by these insights, we introduce $\alpha$-Flow, a broad family of objectives that unifies trajectory flow matching, Shortcut Model, and MeanFlow under one formulation. By adopting a curriculum strategy that smoothly anneals from trajectory flow matching to MeanFlow, $\alpha$-Flow disentangles the conflicting objectives, and achieves better convergence. When trained from scratch on class-conditional ImageNet-1K 256x256 with vanilla DiT backbones, $\alpha$-Flow consistently outperforms MeanFlow across scales and settings. Our largest $\alpha$-Flow-XL/2+ model achieves new state-of-the-art results using vanilla DiT backbones, with FID scores of 2.58 (1-NFE) and 2.15 (2-NFE).

Huijie Zhang, Aliaksandr Siarohin, Willi Menapace, Michael Vasilkovsky, Sergey Tulyakov, Qing Qu, Ivan Skorokhodov• 2025

Related benchmarks

Task	Dataset	Result
Class-conditional Image Generation	ImageNet 256x256	--	967
Image Generation	ImageNet 256x256	--	517
Class-conditional Image Generation	ImageNet 256x256 (val)	--	493
Class-conditional generation	ImageNet 256 x 256 1k (val)	FID2.58	104
Image Generation	ImageNet 256x256 (test)	FID2.81	83
Class-conditional Image Generation	ImageNet-1k (val)	FID2.58	79
Conditional Image Generation	ImageNet 256px 2012 (val)	FID2.16	50
Class-conditional Image Generation	ImageNet 1k (train)	FID2.58	31

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord