Orientation-aware Semantic Segmentation on Icosahedron Spheres
About
We address semantic segmentation on omnidirectional images, to leverage a holistic understanding of the surrounding scene for applications like autonomous driving systems. For the spherical domain, several methods recently adopt an icosahedron mesh, but systems are typically rotation invariant or require significant memory and parameters, thus enabling execution only at very low resolutions. In our work, we propose an orientation-aware CNN framework for the icosahedron mesh. Our representation allows for fast network operations, as our design simplifies to standard network operations of classical CNNs, but under consideration of north-aligned kernel convolutions for features on the sphere. We implement our representation and demonstrate its memory efficiency up-to a level-8 resolution mesh (equivalent to 640 x 1024 equirectangular images). Finally, since our kernels operate on the tangent of the sphere, standard feature weights, pretrained on perspective data, can be directly transferred with only small need for weight refinement. In our evaluation our orientation-aware CNN becomes a new state of the art for the recent 2D3DS dataset, and our Omni-SYNTHIA version of SYNTHIA. Rotation invariant classification and segmentation tasks are additionally presented for comparison to prior art.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | Stanford2D3DS (3-fold cross-validation) | mIoU43.3 | 90 | |
| Semantic segmentation | Stanford2D3D Panoramic 1.0 (Fold-1) | mIoU43.3 | 43 | |
| Semantic segmentation | Stanford2D3D-Panoramic (SPan) v1 (averaged by 3 folds) | mIoU43.3 | 39 | |
| Classification | Spherical MNIST rotated level-4 mesh (train and test (R/R)) | Accuracy97.05 | 16 | |
| Image Classification | Spherical MNIST NR/NR | Accuracy99.45 | 12 | |
| Semantic segmentation | OmniSYNTHIA | mAcc57.1 | 12 | |
| Depth Estimation | Stanford2D3D sphere rank 7 256x512 (test) | MAE0.201 | 7 | |
| Semantic segmentation | Stanford2D3D sphere rank 7 256x512 (test) | Accuracy81.7 | 7 | |
| Classification | Spherical MNIST non-rotated level-4 mesh (train test) | Accuracy99.45 | 3 | |
| Classification | Spherical MNIST non-rotated train and rotated test (N/R) level-4 mesh | Accuracy29.84 | 3 |