Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Object Bounding Boxes for 3D Instance Segmentation on Point Clouds

About

We propose a novel, conceptually simple and general framework for instance segmentation on 3D point clouds. Our method, called 3D-BoNet, follows the simple design philosophy of per-point multilayer perceptrons (MLPs). The framework directly regresses 3D bounding boxes for all instances in a point cloud, while simultaneously predicting a point-level mask for each instance. It consists of a backbone network followed by two parallel network branches for 1) bounding box regression and 2) point mask prediction. 3D-BoNet is single-stage, anchor-free and end-to-end trainable. Moreover, it is remarkably computationally efficient as, unlike existing approaches, it does not require any post-processing steps such as non-maximum suppression, feature sampling, clustering or voting. Extensive experiments show that our approach surpasses existing work on both ScanNet and S3DIS datasets while being approximately 10x more computationally efficient. Comprehensive ablation studies demonstrate the effectiveness of our design.

Bo Yang, Jianan Wang, Ronald Clark, Qingyong Hu, Sen Wang, Andrew Markham, Niki Trigoni• 2019

Related benchmarks

TaskDatasetResultRank
Semantic segmentationS3DIS (Area 5)
mIOU51.8
799
Semantic segmentationS3DIS (6-fold)
mIoU (Mean IoU)59.4
315
3D Instance SegmentationScanNet V2 (val)
Average AP5049
195
3D Instance SegmentationScanNet v2 (test)
mAP48.8
135
3D Instance SegmentationS3DIS (Area 5)
mAP@50% IoU57.5
106
3D Instance SegmentationS3DIS (6-fold CV)
Mean Precision @50% IoU65.6
92
3D Instance SegmentationScanNet hidden v2 (test)
Cabinet AP@0.530.1
69
Instance SegmentationS3DIS (6-fold CV)
mPrec65.6
40
Instance SegmentationS3DIS (Area 5)
mPrec57.6
22
Instance SegmentationScanNet v2 (test)
AP25.3
22
Showing 10 of 16 rows

Other info

Code

Follow for update