Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

OctNet: Learning Deep 3D Representations at High Resolutions

About

We present OctNet, a representation for deep learning with sparse 3D data. In contrast to existing models, our representation enables 3D convolutional networks which are both deep and high resolution. Towards this goal, we exploit the sparsity in the input data to hierarchically partition the space using a set of unbalanced octrees where each leaf node stores a pooled feature representation. This allows to focus memory allocation and computation to the relevant dense regions and enables deeper networks without compromising resolution. We demonstrate the utility of our OctNet representation by analyzing the impact of resolution on several 3D tasks including 3D object classification, orientation estimation and point cloud labeling.

Gernot Riegler, Ali Osman Ulusoy, Andreas Geiger• 2016

Related benchmarks

TaskDatasetResultRank
3D Shape ClassificationModelNet40 (test)
Accuracy86.5
227
Object ClassificationModelNet40 (test)
Accuracy86.5
180
3D shape recognitionModelNet10 (test)
Accuracy90.9
64
Semantic segmentationScanNet (test)
mIoU18.1
59
3D Object ClassificationModelNet10 (test)
Mean Class Accuracy90.1
57
Semantic segmentationS3DIS (test)
mIoU26.3
47
Object ClassificationModelNet40
Instance Accuracy0.865
33
Semantic segmentationRueMonge 2014
mIoU59.2
12
Semantic segmentationSemantic3D full (test)
Overall Accuracy80.7
9
3D Semantic SegmentationRueMonge Varcity TASK3 2014
mIoU59.2
8
Showing 10 of 13 rows

Other info

Follow for update