Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Dynamic Search for Inference-Time Alignment in Diffusion Models

About

Diffusion models have shown promising generative capabilities across diverse domains, yet aligning their outputs with desired reward functions remains a challenge, particularly in cases where reward functions are non-differentiable. Some gradient-free guidance methods have been developed, but they often struggle to achieve optimal inference-time alignment. In this work, we newly frame inference-time alignment in diffusion as a search problem and propose Dynamic Search for Diffusion (DSearch), which subsamples from denoising processes and approximates intermediate node rewards. It also dynamically adjusts beam width and tree expansion to efficiently explore high-reward generations. To refine intermediate decisions, DSearch incorporates adaptive scheduling based on noise levels and a lookahead heuristic function. We validate DSearch across multiple domains, including biological sequence design, molecular optimization, and image generation, demonstrating superior reward optimization compared to existing approaches.

Xiner Li, Masatoshi Uehara, Xingyu Su, Gabriele Scalia, Tommaso Biancalani, Aviv Regev, Sergey Levine, Shuiwang Ji• 2025

Related benchmarks

TaskDatasetResultRank
Semantic Attribute AlignmentGemma animal-attribute prompts
Happy Score0.29
9
Aesthetic Reward OptimizationAnimal prompts 2D image generation
Aesthetic Score6.61
5
HPSv3 Reward OptimizationAnimal prompts 2D image generation
HPSv3 Score9.9
5
3D Aerodynamic optimization3D Vehicle models
Aerodynamic Score0.21
5
Compressibility optimizationAnimal prompts 2D image generation
Compressibility54.85
5
Incompressibility optimizationAnimal prompts 2D image generation
Incompressibility Score120.1
5
Showing 6 of 6 rows

Other info

Follow for update