PointMixer: MLP-Mixer for Point Cloud Understanding

About

MLP-Mixer has newly appeared as a new challenger against the realm of CNNs and transformer. Despite its simplicity compared to transformer, the concept of channel-mixing MLPs and token-mixing MLPs achieves noticeable performance in visual recognition tasks. Unlike images, point clouds are inherently sparse, unordered and irregular, which limits the direct use of MLP-Mixer for point cloud understanding. In this paper, we propose PointMixer, a universal point set operator that facilitates information sharing among unstructured 3D points. By simply replacing token-mixing MLPs with a softmax function, PointMixer can "mix" features within/between point sets. By doing so, PointMixer can be broadly used in the network as inter-set mixing, intra-set mixing, and pyramid mixing. Extensive experiments show the competitive or superior performance of PointMixer in semantic segmentation, classification, and point reconstruction against transformer-based methods.

Jaesung Choe, Chunghyun Park, Francois Rameau, Jaesik Park, In So Kweon• 2021

Related benchmarks

Task	Dataset	Result
Semantic segmentation	S3DIS (Area 5)	mIOU71.4	1006
Shape classification	ModelNet40 (test)	OA93.6	255
Mesh Saliency Prediction	Proposed Dataset Textured Mesh 1.0 (test)	CC0.5261	45
Mesh Saliency Prediction	Proposed Dataset Non-textured 1.0 (test)	CC0.5137	45
3D Semantic Segmentation	S3DIS Area 5 (test)	mIoU (%)71.4	32
Saliency Prediction	SAL3D (test)	CC0.467	15
Point Cloud Reconstruction	ShapeNet Part 87 (test)	Chamfer Distance1.11	5
Point Cloud Reconstruction	ScanNet 9 (test)	CD2.74	5
Point Cloud Reconstruction	ICL-NUIM 20 (test)	CD2.43	5

Showing 9 of 9 rows

Other info

Code

Follow for update

@wizwand_team Discord