Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multi-Level Variational Autoencoder: Learning Disentangled Representations from Grouped Observations

About

We would like to learn a representation of the data which decomposes an observation into factors of variation which we can independently control. Specifically, we want to use minimal supervision to learn a latent representation that reflects the semantics behind a specific grouping of the data, where within a group the samples share a common factor of variation. For example, consider a collection of face images grouped by identity. We wish to anchor the semantics of the grouping into a relevant and disentangled representation that we can easily exploit. However, existing deep probabilistic models often assume that the observations are independent and identically distributed. We present the Multi-Level Variational Autoencoder (ML-VAE), a new deep probabilistic model for learning a disentangled representation of a set of grouped observations. The ML-VAE separates the latent representation into semantically meaningful parts by working both at the group level and the observation level, while retaining efficient test-time inference. Quantitative and qualitative evaluations show that the ML-VAE model (i) learns a semantically meaningful disentanglement of grouped data, (ii) enables manipulation of the latent representation, and (iii) generalises to unseen groups.

Diane Bouchacourt, Ryota Tomioka, Sebastian Nowozin• 2017

Related benchmarks

TaskDatasetResultRank
FoV regressionCars3D (all)
R2 Score0.989
55
Disentangled Representation LearningCars3D
FactorVAE0.87
35
DisentanglementShapes3D--
18
Abstract Visual ReasoningAbstract Visual Reasoning WReN (10^2 samples)
Accuracy17.7
15
DisentanglementMPI3D
BetaVAE Score0.703
13
DisentanglementShapes3D
BetaVAE Score0.976
13
Pose EstimationPascal3D+ chair (test)
Median Angular Error (°)80.6
12
Viewpoint EstimationPascal3D+ Car (test)
Median Error75.6
12
Pose EstimationSynthetic domain cars (unseen instances)
Med. Error (°)9.3
4
Showing 9 of 9 rows

Other info

Follow for update