PointMixer: MLP-Mixer for Point Cloud Understanding
About
MLP-Mixer has newly appeared as a new challenger against the realm of CNNs and transformer. Despite its simplicity compared to transformer, the concept of channel-mixing MLPs and token-mixing MLPs achieves noticeable performance in visual recognition tasks. Unlike images, point clouds are inherently sparse, unordered and irregular, which limits the direct use of MLP-Mixer for point cloud understanding. In this paper, we propose PointMixer, a universal point set operator that facilitates information sharing among unstructured 3D points. By simply replacing token-mixing MLPs with a softmax function, PointMixer can "mix" features within/between point sets. By doing so, PointMixer can be broadly used in the network as inter-set mixing, intra-set mixing, and pyramid mixing. Extensive experiments show the competitive or superior performance of PointMixer in semantic segmentation, classification, and point reconstruction against transformer-based methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | S3DIS (Area 5) | mIOU71.4 | 799 | |
| Shape classification | ModelNet40 (test) | OA93.6 | 255 | |
| Mesh Saliency Prediction | Proposed Dataset Textured Mesh 1.0 (test) | CC0.5261 | 45 | |
| Mesh Saliency Prediction | Proposed Dataset Non-textured 1.0 (test) | CC0.5137 | 45 | |
| 3D Semantic Segmentation | S3DIS Area 5 (test) | mIoU (%)71.4 | 32 | |
| Saliency Prediction | SAL3D (test) | CC0.467 | 15 | |
| Point Cloud Reconstruction | ShapeNet Part 87 (test) | Chamfer Distance1.11 | 5 | |
| Point Cloud Reconstruction | ScanNet 9 (test) | CD2.74 | 5 | |
| Point Cloud Reconstruction | ICL-NUIM 20 (test) | CD2.43 | 5 |