Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ComboStoc: Combinatorial Stochasticity for Diffusion Generative Models

About

In this paper, we study an under-explored but important factor of diffusion generative models, i.e., the combinatorial complexity. Data samples are generally high-dimensional, and for various structured generation tasks, additional attributes are combined to associate with data samples. We show that the space spanned by the combination of dimensions and attributes can be insufficiently covered by existing training schemes of diffusion generative models, potentially limiting test time performance. We present a simple fix to this problem by constructing stochastic processes that fully exploit the combinatorial structures, hence the name ComboStoc. Using this simple strategy, we show that network training is significantly accelerated across diverse data modalities, including images and 3D structured shapes. Moreover, ComboStoc enables a new way of test time generation which uses asynchronous time steps for different dimensions and attributes, thus allowing for varying degrees of control over them. Our code is available at: https://github.com/Xrvitd/ComboStoc

Rui Xu, Jiepeng Wang, Hao Pan, Yang Liu, Xin Tong, Shiqing Xin, Changhe Tu, Taku Komura, Wenping Wang• 2024

Related benchmarks

TaskDatasetResultRank
Image GenerationImageNet 50k samples
FID2.14
52
3D Shape GenerationPartNet chair
FPD4.04
3
3D Shape GenerationPartNet table
FPD3.43
3
Showing 3 of 3 rows

Other info

GitHub

Follow for update