Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Solving Inverse Problems with FLAIR

About

Flow-based latent generative models such as Stable Diffusion 3 are able to generate images with remarkable quality, even enabling photorealistic text-to-image generation. Their impressive performance suggests that these models should also constitute powerful priors for inverse imaging problems, but that approach has not yet led to comparable fidelity. There are several key obstacles: (i) the data likelihood term is usually intractable; (ii) learned generative models cannot be directly conditioned on the distorted observations, leading to conflicting objectives between data likelihood and prior; and (iii) the reconstructions can deviate from the observed data. We present FLAIR, a novel, training-free variational framework that leverages flow-based generative models as prior for inverse problems. To that end, we introduce a variational objective for flow matching that is agnostic to the type of degradation, and combine it with deterministic trajectory adjustments to guide the prior towards regions which are more likely under the posterior. To enforce exact consistency with the observed data, we decouple the optimization of the data fidelity and regularization terms. Moreover, we introduce a time-dependent calibration scheme in which the strength of the regularization is modulated according to off-line accuracy estimates. Results on standard imaging benchmarks demonstrate that FLAIR consistently outperforms existing diffusion- and flow-based methods in terms of reconstruction quality and sample diversity. Our code is available at https://inverseflair.github.io/.

Julius Erbach, Dominik Narnhofer, Andreas Dombos, Bernt Schiele, Jan Eric Lenssen, Konrad Schindler• 2025

Related benchmarks

TaskDatasetResultRank
InpaintingFFHQ 1k
PSNR21.84
14
InpaintingDIV2K 0.8k
PSNR23.9
14
Video EditingVPBench (test)
CLIP Score26.3
13
Image EditingInpaintCOCO 512px
FID41.5
12
Image EditingHumanEdit 1024px
FID30.7
12
Motion DeblurringFFHQ 1k
PSNR28.5
7
Motion DeblurringDIV2K 0.8k
PSNR22.95
7
Super-ResolutionFFHQ 1k
PSNR26.82
7
Gaussian DeblurringFFHQ 1k
PSNR26.84
7
Gaussian DeblurringDIV2K 0.8k
PSNR21.27
7
Showing 10 of 11 rows

Other info

Follow for update