Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Energy-Based Modelling for Discrete and Mixed Data via Heat Equations on Structured Spaces

About

Energy-based models (EBMs) offer a flexible framework for probabilistic modelling across various data domains. However, training EBMs on data in discrete or mixed state spaces poses significant challenges due to the lack of robust and fast sampling methods. In this work, we propose to train discrete EBMs with Energy Discrepancy, a loss function which only requires the evaluation of the energy function at data points and their perturbed counterparts, thus eliminating the need for Markov chain Monte Carlo. We introduce perturbations of the data distribution by simulating a diffusion process on the discrete state space endowed with a graph structure. This allows us to inform the choice of perturbation from the structure of the modelled discrete variable, while the continuous time parameter enables fine-grained control of the perturbation. Empirically, we demonstrate the efficacy of the proposed approaches in a wide range of applications, including the estimation of discrete densities with non-binary vocabulary and binary image modelling. Finally, we train EBMs on tabular data sets with applications in synthetic data generation and calibrated classification.

Tobias Schr\"oder, Zijing Ou, Yingzhen Li, Andrew B. Duncan• 2024

Related benchmarks

TaskDatasetResultRank
Image ModelingOmniglot (test)
NLL93.94
27
Tabular Data GenerationAdult (test)--
12
Tabular Data GenerationBeijing (test)--
12
Graph generationEgo-small (test)
Degree0.063
11
Tabular Data Synthesisbank (test)
AUC0.845
10
Tabular Data SynthesisChurn (test)
AUC0.81
9
Tabular Data SynthesisAdult, Bank, Cardio, Churn, Mushroom, Beijing (test)
Avg Rank3.67
9
Tabular Data SynthesisAdult
Density Similarity (Single Column)0.828
9
Tabular Data SynthesisBank
Density Similarity (Single Column)0.894
9
Tabular Data SynthesisBeijing
Density Similarity (Single-Column)95.1
9
Showing 10 of 33 rows

Other info

Code

Follow for update