Injecting Measurement Information Yields a Fast and Noise-Robust Diffusion-Based Inverse Problem Solver
About
Diffusion models have been firmly established as principled zero-shot solvers for linear and nonlinear inverse problems, owing to their powerful image prior and iterative sampling algorithm. These approaches often rely on Tweedie's formula, which relates the diffusion variate $\mathbf{x}_t$ to the posterior mean $\mathbb{E} [\mathbf{x}_0 | \mathbf{x}_t]$, in order to guide the diffusion trajectory with an estimate of the final denoised sample $\mathbf{x}_0$. However, this does not consider information from the measurement $\mathbf{y}$, which must then be integrated downstream. In this work, we propose to estimate the conditional posterior mean $\mathbb{E} [\mathbf{x}_0 | \mathbf{x}_t, \mathbf{y}]$, which can be formulated as the solution to a lightweight, single-parameter maximum likelihood estimation problem. The resulting prediction can be integrated into any standard sampler, resulting in a fast and memory-efficient inverse solver. Our optimizer is amenable to a noise-aware likelihood-based stopping criteria that is robust to measurement noise in $\mathbf{y}$. We demonstrate comparable or improved performance against a wide selection of contemporary inverse solvers across multiple datasets and tasks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Gaussian Deblurring | FFHQ 256x256-1K | FID22.62 | 37 | |
| Box Inpainting | FFHQ 256x256-1K | FID19.25 | 36 | |
| Motion Deblurring | FFHQ 256x256-1K | FID26.67 | 34 | |
| Random Inpainting | FFHQ 256x256-1K | FID21.19 | 32 | |
| Box Inpainting | ImageNet-1K | LPIPS0.23 | 21 | |
| Gaussian Deblurring | ImageNet-1K | LPIPS0.253 | 21 | |
| Motion Deblurring | ImageNet-1K | LPIPS0.203 | 21 | |
| Super-Resolution (SR x4) | ImageNet-1K | LPIPS0.238 | 21 | |
| Random Inpainting | ImageNet-1K | LPIPS0.142 | 21 | |
| Super-Resolution (SR x4) | FFHQ 256x256-1K | LPIPS0.137 | 20 |