Diffusion models as plug-and-play priors
About
We consider the problem of inferring high-dimensional data $\mathbf{x}$ in a model that consists of a prior $p(\mathbf{x})$ and an auxiliary differentiable constraint $c(\mathbf{x},\mathbf{y})$ on $x$ given some additional information $\mathbf{y}$. In this paper, the prior is an independently trained denoising diffusion generative model. The auxiliary constraint is expected to have a differentiable form, but can come from diverse sources. The possibility of such inference turns diffusion models into plug-and-play modules, thereby allowing a range of potential applications in adapting models to new domains and tasks, such as conditional generation or image segmentation. The structure of diffusion models allows us to perform approximate inference by iterating differentiation through the fixed denoising network enriched with different amounts of noise at each step. Considering many noised versions of $\mathbf{x}$ in evaluation of its fitness is a novel search mechanism that may lead to new algorithms for solving combinatorial optimization problems.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Traveling Salesman Problem | TSP50 | Optimality Gap1.23 | 58 | |
| Traveling Salesman Problem | TSP-100 | Optimality Drop2.11 | 53 | |
| Traveling Salesperson Problem | TSP-100 | Solution Length7.92 | 42 | |
| Traveling Salesman Problem | Euclidean TSP N=50 | Optimal Tour Length5.76 | 26 | |
| Traveling Salesman Problem | TSP-50 | Gap1.23 | 15 | |
| Traveling Salesman Problem | Euclidean TSP N=100 | Objective Value7.92 | 10 | |
| Semantic segmentation | EnviroAtlas (Durham, NC) | Accuracy (%)79.8 | 6 | |
| Semantic segmentation | EnviroAtlas Austin, TX | Accuracy (%)79.5 | 6 | |
| Semantic segmentation | EnviroAtlas Phoenix, AZ | Accuracy69.6 | 6 |