Isolating Sources of Disentanglement in Variational Autoencoders
About
We decompose the evidence lower bound to show the existence of a term measuring the total correlation between latent variables. We use this to motivate our $\beta$-TCVAE (Total Correlation Variational Autoencoder), a refinement of the state-of-the-art $\beta$-VAE objective for learning disentangled representations, requiring no additional hyperparameters during training. We further propose a principled classifier-free measure of disentanglement called the mutual information gap (MIG). We perform extensive quantitative and qualitative experiments, in both restricted and non-restricted settings, and show a strong relation between total correlation and disentanglement, when the latent variables model is trained using our framework.
Ricky T. Q. Chen, Xuechen Li, Roger Grosse, David Duvenaud• 2018
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Time Series Forecasting | ETTh1 | MSE0.418 | 601 | |
| Image Classification | STL-10 (test) | Accuracy27.35 | 357 | |
| Time Series Forecasting | ETTm1 | MSE1.438 | 334 | |
| Time Series Forecasting | ETTh1 (test) | MSE0.681 | 262 | |
| Time Series Forecasting | ETTm1 (test) | MSE1.538 | 196 | |
| Time Series Forecasting | Traffic (test) | MSE0.997 | 192 | |
| Time Series Forecasting | Traffic | MSE0.903 | 145 | |
| Time Series Forecasting | Weather (test) | MSE0.212 | 110 | |
| Time Series Forecasting | Electricity (test) | MSE0.85 | 72 | |
| Image Reconstruction | CelebA-HQ (test) | FID (Reconstruction)80.33 | 50 |
Showing 10 of 60 rows