Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing

About

Diffusion models have achieved remarkable image generation quality surpassing previous generative models. However, a notable limitation of diffusion models, in comparison to GANs, is their difficulty in smoothly interpolating between two image samples, due to their highly unstructured latent space. Such a smooth interpolation is intriguing as it naturally serves as a solution for the image morphing task with many applications. In this work, we present DiffMorpher, the first approach enabling smooth and natural image interpolation using diffusion models. Our key idea is to capture the semantics of the two images by fitting two LoRAs to them respectively, and interpolate between both the LoRA parameters and the latent noises to ensure a smooth semantic transition, where correspondence automatically emerges without the need for annotation. In addition, we propose an attention interpolation and injection technique and a new sampling schedule to further enhance the smoothness between consecutive images. Extensive experiments demonstrate that DiffMorpher achieves starkly better image morphing effects than previous methods across a variety of object categories, bridging a critical functional gap that distinguished diffusion models from GANs.

Kaiwen Zhang, Yifan Zhou, Xudong Xu, Xingang Pan, Bo Dai• 2023

Related benchmarks

TaskDatasetResultRank
Image BlendingTotally Looks Like Low Difficulty
Attribute-Masked DreamSim (Input 1)0.547
8
Image BlendingTotally Looks Like High Difficulty
AMD Score (Input 1)0.39
8
3D Morphing50 source-target pairs (Objaverse, GSO, Trellis)
FID208.1
7
Textured 3D MorphingTextured 3D Morphing (test)
FID218.1
7
Textured 3D MorphingInterp3DData Easy
FID160.9
6
Textured 3D MorphingInterp3DData Mid
FID177.4
6
Textured 3D MorphingInterp3DData Hard
FID170.3
6
Textured 3D MorphingInterp3DData (Average)
FID169.5
6
Generative Video TransitionsVideo Transitions Evaluation Set
FID151
5
3D MorphingTexMorph
MSE(SSIM)0.19
5
Showing 10 of 12 rows

Other info

Follow for update