Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CoDi: Co-evolving Contrastive Diffusion Models for Mixed-type Tabular Synthesis

About

With growing attention to tabular data these days, the attempt to apply a synthetic table to various tasks has been expanded toward various scenarios. Owing to the recent advances in generative modeling, fake data generated by tabular data synthesis models become sophisticated and realistic. However, there still exists a difficulty in modeling discrete variables (columns) of tabular data. In this work, we propose to process continuous and discrete variables separately (but being conditioned on each other) by two diffusion models. The two diffusion models are co-evolved during training by reading conditions from each other. In order to further bind the diffusion models, moreover, we introduce a contrastive learning method with a negative sampling method. In our experiments with 11 real-world tabular datasets and 8 baseline methods, we prove the efficacy of the proposed method, called CoDi.

Chaejeong Lee, Jayoung Kim, Noseong Park• 2023

Related benchmarks

TaskDatasetResultRank
Tabular Data GenerationBeijing
DCR-0021.00e-4
20
Tabular Data Generationmagic
DCR-00251.8
20
Tabular Data GenerationNews
DCR-0020.4976
18
Tabular Data SynthesisAdult
Shape Similarity0.7662
17
Tabular Data SynthesisDiabetes
Shapes0.7868
15
Tabular Data UtilityMagic (test)
AUC0.931
14
Tabular Data UtilityCalifornia (test)
AUC0.981
14
Tabular Data UtilityAdult (test)
AUC0.829
14
Tabular Data UtilityDefault (test)
AUC0.497
14
Tabular Data UtilityShoppers (test)
AUC0.855
13
Showing 10 of 72 rows
...

Other info

Follow for update