Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning SO(3) Equivariant Representations with Spherical CNNs

About

We address the problem of 3D rotation equivariance in convolutional neural networks. 3D rotations have been a challenging nuisance in 3D classification tasks requiring higher capacity and extended data augmentation in order to tackle it. We model 3D data with multi-valued spherical functions and we propose a novel spherical convolutional network that implements exact convolutions on the sphere by realizing them in the spherical harmonic domain. Resulting filters have local symmetry and are localized by enforcing smooth spectra. We apply a novel pooling on the spectral domain and our operations are independent of the underlying spherical resolution throughout the network. We show that networks with much lower capacity and without requiring data augmentation can exhibit performance comparable to the state of the art in standard retrieval and classification benchmarks.

Carlos Esteves, Christine Allen-Blanchette, Ameesh Makadia, Kostas Daniilidis• 2017

Related benchmarks

TaskDatasetResultRank
3D Point Cloud ClassificationModelNet40 (test)
OA88.9
297
Semantic segmentationStanford2D3DS (3-fold cross-validation)
mIoU40.2
90
3D Object ClassificationModelNet40
Accuracy0.889
62
ClassificationSpherical MNIST rotated level-4 mesh (train and test (R/R))
Accuracy98.71
16
3D Object ClassificationModelNet40 rotated (test)
Accuracy88.4
15
Image ClassificationSpherical MNIST NR/NR
Accuracy98.75
12
Shape classificationModelNet40 rotated (test)
Accuracy0.869
9
Image ClassificationSpherical MNIST NR/R
Accuracy98.08
5
3D Object ClassificationModelNet40 upright (test)
Accuracy0.893
5
Shape classificationModelNet40 level 5 resolution (test)
Accuracy88.9
4
Showing 10 of 10 rows

Other info

Follow for update