h-Edit: Effective and Flexible Diffusion-Based Editing via Doob's h-Transform

About

We introduce a theoretical framework for diffusion-based image editing by formulating it as a reverse-time bridge modeling problem. This approach modifies the backward process of a pretrained diffusion model to construct a bridge that converges to an implicit distribution associated with the editing target at time 0. Building on this framework, we propose h-Edit, a novel editing method that utilizes Doob's h-transform and Langevin Monte Carlo to decompose the update of an intermediate edited sample into two components: a "reconstruction" term and an "editing" term. This decomposition provides flexibility, allowing the reconstruction term to be computed via existing inversion techniques and enabling the combination of multiple editing terms to handle complex editing tasks. To our knowledge, h-Edit is the first training-free method capable of performing simultaneous text-guided and reward-model-based editing. Extensive experiments, both quantitative and qualitative, show that h-Edit outperforms state-of-the-art baselines in terms of editing effectiveness and faithfulness. Our source code is available at https://github.com/nktoan/h-edit.

Toan Nguyen, Kien Do, Duc Kieu, Thin Nguyen• 2025

Related benchmarks

Task	Dataset	Result
Image Editing	PIE-Bench (test)	PSNR17.6958	55
Image Editing	PIEBench ++ (test)	FCES25.01	15
Image Editing	PIEBench Default Setting	FCES29.25	12
Image Editing	PIEBench Editing Setting	FCES27.47	12
Image Editing	PIEBench Reconstruction Setting	PSNR27.9	12
Polyp Detection	HyperKvasir (test)	F1 Score92.95	10
Image Editing	ImageNetR-Fake (test)	Structural Distance44.7574	6
Image Editing	PIEBench++ (User Study)	Fidelity Score3.72	5

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord