Mean Flows for One-step Generative Modeling
About
We propose a principled and effective framework for one-step generative modeling. We introduce the notion of average velocity to characterize flow fields, in contrast to instantaneous velocity modeled by Flow Matching methods. A well-defined identity between average and instantaneous velocities is derived and used to guide neural network training. Our method, termed the MeanFlow model, is self-contained and requires no pre-training, distillation, or curriculum learning. MeanFlow demonstrates strong empirical performance: it achieves an FID of 3.43 with a single function evaluation (1-NFE) on ImageNet 256x256 trained from scratch, significantly outperforming previous state-of-the-art one-step diffusion/flow models. Our study substantially narrows the gap between one-step diffusion/flow models and their multi-step predecessors, and we hope it will motivate future research to revisit the foundations of these powerful models.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Class-conditional Image Generation | ImageNet 256x256 | -- | 815 | |
| Image Classification | ImageNet A | Top-1 Acc4.8 | 654 | |
| Image Classification | ImageNet V2 | Top-1 Acc44 | 611 | |
| Image Classification | ImageNet-R | Top-1 Acc4.5 | 529 | |
| Class-conditional Image Generation | ImageNet 256x256 (val) | FID2.2 | 427 | |
| Image Classification | ImageNet-Sketch | Top-1 Accuracy3.9 | 407 | |
| Image Generation | ImageNet 256x256 | IS247.5 | 359 | |
| Image Generation | ImageNet 256x256 (val) | FID3.43 | 340 | |
| Unconditional Image Generation | CIFAR-10 | FID2.88 | 240 | |
| Image Classification | ObjectNet | Top-1 Accuracy10.27 | 219 |