Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

D-Flow: Differentiating through Flows for Controlled Generation

About

Taming the generation outcome of state of the art Diffusion and Flow-Matching (FM) models without having to re-train a task-specific model unlocks a powerful tool for solving inverse problems, conditional generation, and controlled generation in general. In this work we introduce D-Flow, a simple framework for controlling the generation process by differentiating through the flow, optimizing for the source (noise) point. We motivate this framework by our key observation stating that for Diffusion/FM models trained with Gaussian probability paths, differentiating through the generation process projects gradient on the data manifold, implicitly injecting the prior into the optimization process. We validate our framework on linear and non-linear controlled generation problems including: image and audio inverse problems and conditional molecule generation reaching state of the art performance across all.

Heli Ben-Hamu, Omri Puny, Itai Gat, Brian Karrer, Uriel Singer, Yaron Lipman• 2024

Related benchmarks

TaskDatasetResultRank
Super-ResolutionCelebA--
24
DeblurringCelebA
PSNR31.07
11
Random InpaintingCelebA 100 images (test)
PSNR33.67
10
DeblurringAFHQ-Cat 100 images (test)
PSNR27.82
10
Random InpaintingAFHQ-Cat 100 images (test)
PSNR32.2
10
Super-ResolutionAFHQ-Cat 100 images (test)
PSNR24.64
10
Super-ResolutionCelebA 100 images (test)
PSNR30.47
10
DeblurringCelebA 100 images (test)
PSNR31.25
10
DenoisingAFHQ-Cat 100 images (test)
PSNR26.13
10
DenoisingCelebA 100 images (test)
PSNR26.04
10
Showing 10 of 42 rows

Other info

Follow for update