Deep Closest Point: Learning Representations for Point Cloud Registration
About
Point cloud registration is a key problem for computer vision applied to robotics, medical imaging, and other applications. This problem involves finding a rigid transformation from one point cloud into another so that they align. Iterative Closest Point (ICP) and its variants provide simple and easily-implemented iterative methods for this task, but these algorithms can converge to spurious local optima. To address local optima and other difficulties in the ICP pipeline, we propose a learning-based method, titled Deep Closest Point (DCP), inspired by recent techniques in computer vision and natural language processing. Our model consists of three parts: a point cloud embedding network, an attention-based module combined with a pointer generation layer, to approximate combinatorial matching, and a differentiable singular value decomposition (SVD) layer to extract the final rigid transformation. We train our model end-to-end on the ModelNet40 dataset and show in several settings that it performs better than ICP, its variants (e.g., Go-ICP, FGR), and the recently-proposed learning-based method PointNetLK. Beyond providing a state-of-the-art registration technique, we evaluate the suitability of our learned features transferred to unseen objects. We also provide preliminary analysis of our learned model to help understand whether domain-specific and/or global features facilitate rigid registration.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Point cloud registration | 3DMatch (test) | Registration Recall3.2 | 339 | |
| 6D Object Pose Estimation | LineMOD | -- | 50 | |
| Point cloud registration | ModelNet40 (Unseen categories) | RMSE (Rotation)3.1502 | 36 | |
| Point cloud registration | ModelNet40 RPMNet manner (Unseen Shapes) | RMSE(R)4.291 | 32 | |
| Point cloud registration | ModelNet40 twice-sampled (TS) unseen categories (test) | RMSE (Rotation)6.754 | 30 | |
| Point cloud registration | ModelNet 40 (test) | RRE11.975 | 27 | |
| 6D Object Pose Estimation | Occlusion LINEMOD | -- | 27 | |
| Point cloud registration | nuScenes | RRE (°)2.07 | 25 | |
| Point cloud registration | ModelNet40 (test) | Inference Time (s)0.0032 | 24 | |
| Point cloud registration | ModelNet40 Unseen Categories with Gaussian Noise RPMNet manner (OS) | RMSE (Rotation)4.862 | 21 |