Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Pseudo-Spherical Contrastive Divergence

About

Energy-based models (EBMs) offer flexible distribution parametrization. However, due to the intractable partition function, they are typically trained via contrastive divergence for maximum likelihood estimation. In this paper, we propose pseudo-spherical contrastive divergence (PS-CD) to generalize maximum likelihood learning of EBMs. PS-CD is derived from the maximization of a family of strictly proper homogeneous scoring rules, which avoids the computation of the intractable partition function and provides a generalized family of learning objectives that include contrastive divergence as a special case. Moreover, PS-CD allows us to flexibly choose various learning objectives to train EBMs without additional computational cost or variational minimax optimization. Theoretical analysis on the proposed method and extensive experiments on both synthetic data and commonly used image datasets demonstrate the effectiveness and modeling flexibility of PS-CD, as well as its robustness to data contamination, thus showing its superiority over maximum likelihood and $f$-EBMs.

Lantao Yu, Jiaming Song, Yang Song, Stefano Ermon• 2021

Related benchmarks

TaskDatasetResultRank
Out-of-Distribution DetectionCIFAR-10 (ID) vs Celeb-A (OOD)
AUROC58
55
OOD DetectionTextures (OOD)
AUROC44
38
Image GenerationCIFAR-10 (32 x 32) Conditional (test)
FID27.95
13
Image GenerationCelebA 64x64 Unconditional (test)
FID20.35
11
OOD DetectionSVHN (OOD)
AUROC0.56
4
OOD DetectionCIFAR-10 Interpolation (OOD)
AUROC68
4
OOD DetectionUniform/Gaussian Synthetic (OOD)
AUROC100
4
Showing 7 of 7 rows

Other info

Follow for update