The Velocity Deficit: Initial Energy Injection for Flow Matching

About

While Flow Matching theoretically guarantees constant-velocity trajectories, we identify a critical breakdown in high-dimensional practice: the Velocity Deficit. We show that the MSE objective systematically underestimates velocity magnitude, causing generated samples to fail to reach the data manifold-a phenomenon we term Integration Lag. To rectify this, we propose Initial Energy Injection, instantiated via two complementary methods: the training-based Magnitude-Aware Flow Matching (MAFM) and the training-free Scale Schedule Corrector (SSC). Both are grounded in our discovery of a crucial asymmetry: velocity contraction causes harmful kinetic stagnation at the trajectory's start, yet acts as a beneficial denoising mechanism at its end. Empirically, SSC yields significant efficiency gains with zero retraining and just one line of code. On ImageNet-1k (256x256), it improves FID by 44.6% (from 13.68 to 7.58) and achieves a 5x speedup, enabling a 50-step generator (FID 7.58) to beat a 250-step baseline (FID 8.65). Furthermore, our methods generalize to Text-to-Image tasks and high-resolution generation, improving FID on MS-COCO by ~22%.

Linze Li, Zong-Wei Hong, Shen Zhang, Bo Lin, Jinglun Li, Yao Tang, Jiajun Liang• 2026

Related benchmarks

Task	Dataset	Result
Text-to-Image Generation	MS-COCO	FID4.71	193
Class-conditional generation	ImageNet 256 x 256 1k (val)	FID7.69	130
Class-conditional Image Generation	ImageNet-1K 256x256	FID14.26	26
Class-conditional Image Generation	ImageNet-1k 512x512	FID8.39	11

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord