Generative Modeling via Drifting
About
Generative modeling can be formulated as learning a mapping f such that its pushforward distribution matches the data distribution. The pushforward behavior can be carried out iteratively at inference time, for example in diffusion and flow-based models. In this paper, we propose a new paradigm called Drifting Models, which evolve the pushforward distribution during training and naturally admit one-step inference. We introduce a drifting field that governs the sample movement and achieves equilibrium when the distributions match. This leads to a training objective that allows the neural network optimizer to evolve the distribution. In experiments, our one-step generator achieves state-of-the-art results on ImageNet at 256 x 256 resolution, with an FID of 1.54 in latent space and 1.61 in pixel space. We hope that our work opens up new opportunities for high-quality one-step generation.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Class-conditional Image Generation | ImageNet 256x256 | Inception Score (IS)263.2 | 967 | |
| Image Generation | ImageNet 256x256 | IS258.9 | 517 | |
| Image Generation | ImageNet 256x256 (val) | FID1.43 | 399 | |
| Face Generation | FFHQ | EMD129.3 | 42 | |
| 3D pointcloud manipulation | MetaWorld | Success Rate (Easy)92.7 | 30 | |
| Robotic Manipulation | DexArt | Success Rate (Bucket)29 | 29 | |
| Robot Manipulation | MetaWorld, Adroit, and Dexart Combined | Average Success Rate79.8 | 25 | |
| Dexterous Manipulation | Adroit | Hammer Success100 | 17 | |
| CT Image Denoising | Mayo quarter-dose (test) | PSNR45.94 | 13 | |
| Molecule Generation | QM9 | Validity22 | 10 |