Training-Free Guidance Beyond Differentiability: Scalable Path Steering with Tree Search in Diffusion and Flow Models
About
Training-free guidance enables controlled generation in diffusion and flow models, but most methods rely on gradients and assume differentiable objectives. This work focuses on training-free guidance addressing challenges from non-differentiable objectives and discrete data distributions. We propose TreeG: Tree Search-Based Path Steering Guidance, applicable to both continuous and discrete settings in diffusion and flow models. TreeG offers a unified framework for training-free guidance by proposing, evaluating, and selecting candidates at each step, enhanced with tree search over active paths and parallel exploration. We comprehensively investigate the design space of TreeG over the candidate proposal module and the evaluation function, instantiating TreeG into three novel algorithms. Our experiments show that TreeG consistently outperforms top guidance baselines in symbolic music generation, small molecule design, and enhancer DNA design with improvements of 29.01%, 16.6%, and 18.43%. Additionally, we identify an inference-time scaling law showing TreeG's scalability in inference-time computation.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Text-to-Image Generation | ImageReward (test) | ImageReward Score1.023 | 16 | |
| Semantic Attribute Alignment | Gemma animal-attribute prompts | Happy Score0.08 | 9 | |
| 3D Aerodynamic optimization | 3D Vehicle models | Aerodynamic Score0.24 | 5 | |
| Aesthetic Reward Optimization | Animal prompts 2D image generation | Aesthetic Score6 | 5 | |
| HPSv3 Reward Optimization | Animal prompts 2D image generation | HPSv3 Score8.4 | 5 | |
| Compressibility optimization | Animal prompts 2D image generation | Compressibility79.32 | 5 | |
| Incompressibility optimization | Animal prompts 2D image generation | Incompressibility Score110.8 | 5 |