Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Spherical CNNs

About

Convolutional Neural Networks (CNNs) have become the method of choice for learning problems involving 2D planar images. However, a number of problems of recent interest have created a demand for models that can analyze spherical images. Examples include omnidirectional vision for drones, robots, and autonomous cars, molecular regression problems, and global weather and climate modelling. A naive application of convolutional networks to a planar projection of the spherical signal is destined to fail, because the space-varying distortions introduced by such a projection will make translational weight sharing ineffective. In this paper we introduce the building blocks for constructing spherical CNNs. We propose a definition for the spherical cross-correlation that is both expressive and rotation-equivariant. The spherical correlation satisfies a generalized Fourier theorem, which allows us to compute it efficiently using a generalized (non-commutative) Fast Fourier Transform (FFT) algorithm. We demonstrate the computational efficiency, numerical accuracy, and effectiveness of spherical CNNs applied to 3D model recognition and atomization energy regression.

Taco S. Cohen, Mario Geiger, Jonas Koehler, Max Welling• 2018

Related benchmarks

TaskDatasetResultRank
3D Object ClassificationModelNet40
Accuracy0.85
62
Shape RetrievalShapeNetCore55 SHREC2017 (test)
Precision (P)70.1
25
ClassificationSpherical MNIST rotated level-4 mesh (train and test (R/R))
Accuracy95
16
Panoramic classificationSPH-MNIST synthetic (test)
Error Rate6.97
15
Image ClassificationSpherical MNIST NR/NR
Accuracy96
12
Panoramic classificationSPH-CIFAR10 (test)
Accuracy10
10
Image ClassificationSpherical MNIST NR/R
Accuracy93.4
5
Digit RecognitionSpherical MNIST (test)
Accuracy96
3
Showing 8 of 8 rows

Other info

Follow for update