TFG: Unified Training-Free Guidance for Diffusion Models
About
Given an unconditional diffusion model and a predictor for a target property of interest (e.g., a classifier), the goal of training-free guidance is to generate samples with desirable target properties without additional training. Existing methods, though effective in various individual applications, often lack theoretical grounding and rigorous testing on extensive benchmarks. As a result, they could even fail on simple tasks, and applying them to a new problem becomes unavoidably difficult. This paper introduces a novel algorithmic framework encompassing existing methods as special cases, unifying the study of training-free guidance into the analysis of an algorithm-agnostic design space. Via theoretical and empirical investigation, we propose an efficient and effective hyper-parameter searching strategy that can be readily applied to any downstream task. We systematically benchmark across 7 diffusion models on 16 tasks with 40 targets, and improve performance by 8.5% on average. Our framework and benchmark offer a solid foundation for conditional generation in a training-free manner.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Quantum Property Prediction | QM9 | Dipole Moment (mu)1.33 | 35 | |
| Nearby Molecular Sampling | ZINC250k and GEOM-DRUG (random 800 source molecules) | Success Rate94.37 | 22 | |
| Offline Reinforcement Learning | D4RL v2 (various) | Average Score82.1 | 17 | |
| Prize Collecting Traveling Salesperson Problem | PCTSP-20 | Optimality Gap2.54 | 15 | |
| Gaussian Deblur 3 | Cats | LPIPS0.11 | 14 | |
| Gaussian Deblurring | ImageNet Gaussian Blur sigma=3 | LPIPS0.16 | 14 | |
| Super-Resolution (4x) | Cats | LPIPS0.08 | 14 | |
| Super-Resolution | ImageNet 4x scale | LPIPS0.13 | 14 | |
| Inpainting | Cats | LPIPS0.09 | 14 | |
| Gaussian Deblur 12 | Cats | LPIPS0.29 | 14 |