Canonical Capsules: Self-Supervised Capsules in Canonical Pose

About

We propose a self-supervised capsule architecture for 3D point clouds. We compute capsule decompositions of objects through permutation-equivariant attention, and self-supervise the process by training with pairs of randomly rotated objects. Our key idea is to aggregate the attention masks into semantic keypoints, and use these to supervise a decomposition that satisfies the capsule invariance/equivariance properties. This not only enables the training of a semantically consistent decomposition, but also allows us to learn a canonicalization operation that enables object-centric reasoning. To train our neural network we require neither classification labels nor manually-aligned training datasets. Yet, by learning an object-centric representation in a self-supervised manner, our method outperforms the state-of-the-art on 3D point cloud reconstruction, canonicalization, and unsupervised classification.

Weiwei Sun, Andrea Tagliasacchi, Boyang Deng, Sara Sabour, Soroosh Yazdani, Geoffrey Hinton, Kwang Moo Yi• 2020

Related benchmarks

Task	Dataset	Result
Full Shape Canonicalization	ShapeNet 13 categories (test)	CC (Category-Level Consistency)0.1262	12
3D registration	ShapeNet core (full shapes)	RMSE (Airplane)0.024	7
Registration	ShapeNet Airplane (full shapes) core (test)	RMSE0.024	7
Registration	ShapeNet Chair (full shapes) core (test)	RMSE0.027	7
Registration	ShapeNet Multi (full shapes) core (test)	RMSE0.07	7
3D object canonicalization	DREDS	Aeroplane IC0.785	5

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord