REGTR: End-to-end Point Cloud Correspondences with Transformers
About
Despite recent success in incorporating learning into point cloud registration, many works focus on learning feature descriptors and continue to rely on nearest-neighbor feature matching and outlier filtering through RANSAC to obtain the final set of correspondences for pose estimation. In this work, we conjecture that attention mechanisms can replace the role of explicit feature matching and RANSAC, and thus propose an end-to-end framework to directly predict the final set of correspondences. We use a network architecture consisting primarily of transformer layers containing self and cross attentions, and train it to predict the probability each point lies in the overlapping region and its corresponding position in the other point cloud. The required rigid transformation can then be estimated directly from the predicted correspondences without further post-processing. Despite its simplicity, our approach achieves state-of-the-art performance on 3DMatch and ModelNet benchmarks. Our source code can be found at https://github.com/yewzijian/RegTR .
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Point cloud registration | 3DMatch (test) | Registration Recall92 | 339 | |
| Point cloud registration | 3DLoMatch (test) | Registration Recall64.8 | 287 | |
| Point cloud registration | 3DLoMatch Indoor (test) | RR64.8 | 66 | |
| Point cloud registration | 3DMatch | Registration Recall (RR)91.9 | 51 | |
| Point cloud registration | ModelNet 40 (test) | RRE1.473 | 27 | |
| Pairwise point cloud registration | 3DLoMatch | RR64.6 | 23 | |
| 3D Point Cloud Registration | 3DMatch (test) | Total Time91 | 21 | |
| Point cloud registration | ModelLoNet 40 (test) | RRE3.93 | 17 | |
| Point cloud registration | 3DMatch indoor RGBD (test) | Registration Recall (5k samples)92 | 16 | |
| Point cloud registration | 3DLoMatch indoor RGBD (test) | Recall (5k samples)64.8 | 16 |