G2D2: Gradient-Guided Discrete Diffusion for Inverse Problem Solving
About
Recent literature has effectively leveraged diffusion models trained on continuous variables as priors for solving inverse problems. Notably, discrete diffusion models with discrete latent codes have shown strong performance, particularly in modalities suited for discrete compressed representations, such as image and motion generation. However, their discrete and non-differentiable nature has limited their application to inverse problems formulated in continuous spaces. This paper presents a novel method for addressing linear inverse problems by leveraging generative models based on discrete diffusion as priors. We overcome these limitations by approximating the true posterior distribution with a variational distribution constructed from categorical distributions and continuous relaxation techniques. Furthermore, we employ a star-shaped noise process to mitigate the drawbacks of traditional discrete diffusion models with absorbing states, demonstrating that our method performs comparably to continuous diffusion techniques with a lower GPU memory consumption. Our code is available at https://github.com/sony/g2d2.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Super-Resolution (4x) | FFHQ | PSNR27.29 | 42 | |
| Deblur | FFHQ | PSNR26.91 | 10 | |
| AND inverse problem | MNIST Medium | Token Accuracy95.37 | 5 | |
| AND inverse problem | MNIST (Hard) | Token Accuracy92.34 | 5 | |
| XOR inverse problem | MNIST (Easy) | Token Accuracy98.3 | 5 | |
| AND inverse problem | MNIST (Easy) | Token Accuracy98.06 | 5 | |
| Box Inpainting | MNIST (Easy) | Token Accuracy95.94 | 5 | |
| Inpainting | MNIST (Easy) | Token Accuracy97.75 | 5 | |
| Inpainting | MNIST Medium | Token Accuracy0.9427 | 5 | |
| Inpainting | MNIST (Hard) | Token Accuracy90.74 | 5 |