DBMSolver: A Training-free Diffusion Bridge Sampler for High-Quality Image-to-Image Translation
About
Diffusion-based image-to-image (I2I) translation excels in high-fidelity generation but suffers from slow sampling in state-of-the-art Diffusion Bridge Models (DBMs), often requiring dozens of function evaluations (NFEs). We introduce DBMSolver, a training-free sampler that exploits the semi-linear structure of DBM's underlying SDE and ODE via exponential integrators, yielding highly-efficient 1st- and 2nd-order solutions. This reduces NFEs by up to 5x while boosting quality (e.g., FID drops 53% on DIODE at 20 NFEs vs. 2nd-order baseline). Experiments on inpainting, stylization, and semantics-to-image tasks across resolutions up to 256x256 show DBMSolver sets new SOTA efficiency-quality tradeoffs, enabling real-world applicability. Our code is publicly available at https://github.com/snumprlab/dbmsolver.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image-to-Image Translation | Edges -> Handbags 64 x 64 (test) | FID0.53 | 21 | |
| Class-Conditional Inpainting | ImageNet center 128x128 mask 256 x 256 | FID4.07 | 11 | |
| Surface Normals-to-Image Translation | DIODE 256 x 256 (test) | FID2.06 | 10 |