Multi-VAE: Learning Disentangled View-common and View-peculiar Visual Representations for Multi-view Clustering
About
Multi-view clustering, a long-standing and important research problem, focuses on mining complementary information from diverse views. However, existing works often fuse multiple views' representations or handle clustering in a common feature space, which may result in their entanglement especially for visual representations. To address this issue, we present a novel VAE-based multi-view clustering framework (Multi-VAE) by learning disentangled visual representations. Concretely, we define a view-common variable and multiple view-peculiar variables in the generative model. The prior of view-common variable obeys approximately discrete Gumbel Softmax distribution, which is introduced to extract the common cluster factor of multiple views. Meanwhile, the prior of view-peculiar variable follows continuous Gaussian distribution, which is used to represent each view's peculiar visual factors. By controlling the mutual information capacity to disentangle the view-common and view-peculiar representations, continuous visual information of multiple views can be separated so that their common discrete cluster information can be effectively mined. Experimental results demonstrate that Multi-VAE enjoys the disentangled and explainable visual representations, while obtaining superior clustering performance compared with state-of-the-art methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Clustering | COIL-20 | ACC65.77 | 47 | |
| Clustering | COIL-100 | ACC48.87 | 28 | |
| Clustering | E-MNIST | Accuracy60.74 | 25 | |
| Clustering | E-FMNIST | ACC (Clustering)53.16 | 13 | |
| Clustering | Office-31 | ACC_clu31.27 | 13 | |
| Classification | Office-31 (test) | Accuracy (cls)62.53 | 11 | |
| Classification | COIL-20 (test) | Classification Accuracy (ACCcls)90.39 | 11 | |
| Classification | E-FMNIST (test) | Accuracy81.54 | 11 | |
| Classification | E-MNIST (test) | Accuracy92.73 | 11 | |
| Classification | COIL-100 (test) | Accuracy74.11 | 11 |