How to Guide Your Flow: Few-Step Alignment via Flow Map Reward Guidance
About
In generative modeling, we often wish to produce samples that maximize a user-specified reward such as aesthetic quality or alignment with human preferences, a problem known as \textit{guidance}. Despite their widespread use, existing guidance methods either require expensive multi-particle, many-step schemes or rely on poorly understood approximations. We reformulate guidance as a \textit{deterministic optimal control problem}, yielding a hierarchy of algorithms that subsumes existing approaches at the coarsest level. We show that the \textit{flow map}, an object of significant recent interest for its role in fast inference, arises naturally in the optimal solution. Based on this observation, we propose \textbf{Flow Map Reward Guidance (FMRG)}: a training-free, \textit{single-trajectory} framework that uses the flow map to both integrate and guide the flow. At text-to-image scale, FMRG matches or surpasses baselines across inverse problems and reward-guided generation with \textbf{as few as 3 NFEs}, giving at least an order-of-magnitude speedup in comparison to prior state of the art.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Text-to-Image Generation | GenEval | Overall Score80 | 704 | |
| Inpainting | FFHQ | LPIPS0.103 | 62 | |
| Motion Deblur | FFHQ | PSNR28.62 | 56 | |
| Super-Resolution | AFHQ | PSNR27.39 | 9 | |
| Motion Deblur | AFHQ | PSNR27.36 | 5 | |
| Style Transfer | Style Transfer Gram matrix loss | CLIP-I Score64.9 | 5 |