Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

3D Semantic Segmentation with Submanifold Sparse Convolutional Networks

About

Convolutional networks are the de-facto standard for analyzing spatio-temporal data such as images, videos, and 3D shapes. Whilst some of this data is naturally dense (e.g., photos), many other data sources are inherently sparse. Examples include 3D point clouds that were obtained using a LiDAR scanner or RGB-D camera. Standard "dense" implementations of convolutional networks are very inefficient when applied on such sparse data. We introduce new sparse convolutional operations that are designed to process spatially-sparse data more efficiently, and use them to develop spatially-sparse convolutional networks. We demonstrate the strong performance of the resulting models, called submanifold sparse convolutional networks (SSCNs), on two tasks involving semantic segmentation of 3D point clouds. In particular, our models outperform all prior state-of-the-art on the test set of a recent semantic segmentation competition.

Benjamin Graham, Martin Engelcke, Laurens van der Maaten• 2017

Related benchmarks

TaskDatasetResultRank
Semantic segmentationS3DIS (Area 5)
mIOU66.9
799
Semantic segmentationSemanticKITTI (test)
mIoU61.8
335
Semantic segmentationS3DIS (6-fold)
mIoU (Mean IoU)72.6
315
Part SegmentationShapeNetPart (test)
mIoU (Inst.)86
312
Semantic segmentationScanNet V2 (val)
mIoU72.1
288
Semantic segmentationScanNet v2 (test)
mIoU72.5
248
Semantic segmentationScanNet (val)
mIoU72.2
231
3D Shape ClassificationModelNet40 (test)
Accuracy91.6
227
3D Semantic SegmentationScanNet V2 (val)
mIoU72.9
171
LiDAR Semantic SegmentationnuScenes (val)
mIoU75.2
169
Showing 10 of 47 rows

Other info

Code

Follow for update