Point2Sequence: Learning the Shape Representation of 3D Point Clouds with an Attention-based Sequence to Sequence Network
About
Exploring contextual information in the local region is important for shape understanding and analysis. Existing studies often employ hand-crafted or explicit ways to encode contextual information of local regions. However, it is hard to capture fine-grained contextual information in hand-crafted or explicit manners, such as the correlation between different areas in a local region, which limits the discriminative ability of learned features. To resolve this issue, we propose a novel deep learning model for 3D point clouds, named Point2Sequence, to learn 3D shape features by capturing fine-grained contextual information in a novel implicit way. Point2Sequence employs a novel sequence learning model for point clouds to capture the correlations by aggregating multi-scale areas of each local region with attention. Specifically, Point2Sequence first learns the feature of each area scale in a local region. Then, it captures the correlation between area scales in the process of aggregating all area scales using a recurrent neural network (RNN) based encoder-decoder structure, where an attention mechanism is proposed to highlight the importance of different area scales. Experimental results show that Point2Sequence achieves state-of-the-art performance in shape classification and segmentation tasks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Part Segmentation | ShapeNetPart (test) | mIoU (Inst.)85.2 | 312 | |
| 3D Object Classification | ModelNet40 (test) | Accuracy92.6 | 302 | |
| 3D Point Cloud Classification | ModelNet40 (test) | OA92.6 | 297 | |
| Shape classification | ModelNet40 (test) | OA92.6 | 255 | |
| 3D Shape Classification | ModelNet40 (test) | Accuracy92.6 | 227 | |
| Point Cloud Classification | ModelNet40 (test) | Accuracy92.6 | 224 | |
| Part Segmentation | ShapeNetPart | mIoU (Instance)85.2 | 198 | |
| Object Classification | ModelNet40 (test) | -- | 180 | |
| 3D Object Part Segmentation | ShapeNet Part (test) | mIoU85.2 | 114 | |
| Shape classification | ModelNet40 | Accuracy92.6 | 85 |