Spherical CNNs
About
Convolutional Neural Networks (CNNs) have become the method of choice for learning problems involving 2D planar images. However, a number of problems of recent interest have created a demand for models that can analyze spherical images. Examples include omnidirectional vision for drones, robots, and autonomous cars, molecular regression problems, and global weather and climate modelling. A naive application of convolutional networks to a planar projection of the spherical signal is destined to fail, because the space-varying distortions introduced by such a projection will make translational weight sharing ineffective. In this paper we introduce the building blocks for constructing spherical CNNs. We propose a definition for the spherical cross-correlation that is both expressive and rotation-equivariant. The spherical correlation satisfies a generalized Fourier theorem, which allows us to compute it efficiently using a generalized (non-commutative) Fast Fourier Transform (FFT) algorithm. We demonstrate the computational efficiency, numerical accuracy, and effectiveness of spherical CNNs applied to 3D model recognition and atomization energy regression.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Object Classification | ModelNet40 | Accuracy0.85 | 62 | |
| Shape Retrieval | ShapeNetCore55 SHREC2017 (test) | Precision (P)70.1 | 25 | |
| Classification | Spherical MNIST rotated level-4 mesh (train and test (R/R)) | Accuracy95 | 16 | |
| Panoramic classification | SPH-MNIST synthetic (test) | Error Rate6.97 | 15 | |
| Image Classification | Spherical MNIST NR/NR | Accuracy96 | 12 | |
| Panoramic classification | SPH-CIFAR10 (test) | Accuracy10 | 10 | |
| Image Classification | Spherical MNIST NR/R | Accuracy93.4 | 5 | |
| Digit Recognition | Spherical MNIST (test) | Accuracy96 | 3 |