Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Distribution Matching Distillation without Fake Score Network

About

Distribution Matching Distillation (DMD) provides an effective distribution-level correction for few-step generation, while relying on an auxiliary fake-score network to track the evolving generative distribution. Recent work combines DMD-style objectives with flow-map generators to exploit both forward-divergence training and reverse-divergence correction. The fake-score estimator remains an additional component with memory and update overhead. In this work, we study whether this explicit tracker can be avoided when the generator itself has a flow-map structure. We propose Fake-Score-network-Free DMD (FSF-DMD), a DMD formulation for flow-map generators that replaces the auxiliary fake-score estimator with a generator-induced pseudo-velocity surrogate. The key observation is that the endpoint pseudo-velocity of a flow-map generator provides a tractable proxy for fake-velocity estimation, allowing the generator itself to supply the reverse-divergence signal. Building on this observation, we derive a practical objective, extend it with flow-map-consistent backward simulation, and introduce a self-teacher variant for training from scratch. In our ImageNet-1K $256 \times 256$ experiments, FSF-DMD improves flow-map baselines, reaches lower FID than the listed DMD2 comparisons in the flow-map-initialized setting, and remains effective under flow-matching initialization and training from scratch.

Youngjoong Kim, Deokyeong Lee, Jaesik Park• 2026

Related benchmarks

TaskDatasetResultRank
Image GenerationImageNet-256 (FID-50K)
FID3.85
36
Image GenerationImageNet-1K 256x256
FID6.35
26
Image GenerationImageNet-1K 256
FID4.7
6
Showing 3 of 3 rows

Other info

Follow for update