Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

A Unified Query-based Paradigm for Point Cloud Understanding

About

3D point cloud understanding is an important component in autonomous driving and robotics. In this paper, we present a novel Embedding-Querying paradigm (EQ- Paradigm) for 3D understanding tasks including detection, segmentation, and classification. EQ-Paradigm is a unified paradigm that enables the combination of any existing 3D backbone architectures with different task heads. Under the EQ-Paradigm, the input is firstly encoded in the embedding stage with an arbitrary feature extraction architecture, which is independent of tasks and heads. Then, the querying stage enables the encoded features to be applicable for diverse task heads. This is achieved by introducing an intermediate representation, i.e., Q-representation, in the querying stage to serve as a bridge between the embedding stage and task heads. We design a novel Q- Net as the querying stage network. Extensive experimental results on various 3D tasks, including object detection, semantic segmentation and shape classification, show that EQ-Paradigm in tandem with Q-Net is a general and effective pipeline, which enables a flexible collaboration of backbones and heads, and further boosts the performance of the state-of-the-art methods. Codes and models are available at https://github.com/dvlab-research/DeepVision3D.

Zetong Yang, Li Jiang, Yanan Sun, Bernt Schiele, Jiaya Jia• 2022

Related benchmarks

TaskDatasetResultRank
Semantic segmentationS3DIS (Area 5)
mIOU71.3
799
Semantic segmentationS3DIS (6-fold)
mIoU (Mean IoU)77.5
315
Semantic segmentationScanNet V2 (val)
mIoU75.3
288
Shape classificationModelNet40 (test)
OA93.2
255
3D Object DetectionKITTI car (test)
AP3D (Easy)90.31
195
3D Semantic SegmentationScanNet V2 (val)
mIoU75.3
171
3D Semantic SegmentationScanNet v2 (test)
mIoU74.3
110
Shape classificationModelNet40
Accuracy93.2
85
Object DetectionScanNet v2 (test)
AP@0.5050
70
3D Object DetectionSUN RGB-D (test)
mAP@0.2560.5
64
Showing 10 of 18 rows

Other info

Code

Follow for update