Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SyncTweedies: A General Generative Framework Based on Synchronized Diffusions

About

We introduce a general framework for generating diverse visual content, including ambiguous images, panorama images, mesh textures, and Gaussian splat textures, by synchronizing multiple diffusion processes. We present exhaustive investigation into all possible scenarios for synchronizing multiple diffusion processes through a canonical space and analyze their characteristics across applications. In doing so, we reveal a previously unexplored case: averaging the outputs of Tweedie's formula while conducting denoising in multiple instance spaces. This case also provides the best quality with the widest applicability to downstream tasks. We name this case SyncTweedies. In our experiments generating visual content aforementioned, we demonstrate the superior quality of generation by SyncTweedies compared to other synchronization methods, optimization-based and iterative-update-based methods.

Jaihoon Kim, Juil Koo, Kyeongmin Yeo, Minhyuk Sung• 2024

Related benchmarks

TaskDatasetResultRank
3D mesh texturing3D mesh texturing (test)
KID186.6
4
Ambiguous Image GenerationDeepFloyd-IF
KID215.1
4
Mask-based Text-to-Image GenerationMask-based T2I generation
KID117.4
4
Wide image generationWide Image Generation 2048 x 512
KID51.024
3
Showing 4 of 4 rows

Other info

Follow for update