Deep Closest Point: Learning Representations for Point Cloud Registration

About

Point cloud registration is a key problem for computer vision applied to robotics, medical imaging, and other applications. This problem involves finding a rigid transformation from one point cloud into another so that they align. Iterative Closest Point (ICP) and its variants provide simple and easily-implemented iterative methods for this task, but these algorithms can converge to spurious local optima. To address local optima and other difficulties in the ICP pipeline, we propose a learning-based method, titled Deep Closest Point (DCP), inspired by recent techniques in computer vision and natural language processing. Our model consists of three parts: a point cloud embedding network, an attention-based module combined with a pointer generation layer, to approximate combinatorial matching, and a differentiable singular value decomposition (SVD) layer to extract the final rigid transformation. We train our model end-to-end on the ModelNet40 dataset and show in several settings that it performs better than ICP, its variants (e.g., Go-ICP, FGR), and the recently-proposed learning-based method PointNetLK. Beyond providing a state-of-the-art registration technique, we evaluate the suitability of our learned features transferred to unseen objects. We also provide preliminary analysis of our learned model to help understand whether domain-specific and/or global features facilitate rigid registration.

Yue Wang, Justin M. Solomon• 2019

Related benchmarks

Task	Dataset	Result
Point cloud registration	3DMatch (test)	Registration Recall3.2	393
6D Object Pose Estimation	LineMOD	--	60
3D Point Cloud Registration	3DMatch	Translation Error (cm)21.4	44
Point cloud registration	ModelNet40 (Unseen categories)	RMSE (Rotation)3.1502	36
Point cloud registration	ModelNet 40 (test)	RRE11.975	33
Point cloud registration	ModelNet40 RPMNet manner (Unseen Shapes)	RMSE(R)4.291	32
Point cloud registration	ModelNet40 twice-sampled (TS) unseen categories (test)	RMSE (Rotation)6.754	30
6D Object Pose Estimation	Occlusion LINEMOD	--	27
Point cloud registration	nuScenes	RRE (°)2.07	25
Point cloud registration	ModelNet40 (test)	Inference Time (s)0.0032	24

Showing 10 of 42 rows

Other info

Code

Follow for update

@wizwand_team Discord