Point2Pose: Occlusion-Recovering 6D Pose Tracking and 3D Reconstruction for Multiple Unknown Objects Via 2D Point Trackers
About
We present Point2Pose, a model-free method for causal 6D pose tracking of multiple rigid objects from monocular RGB-D video. Initialized only from sparse image points on the objects to be tracked, our approach tracks multiple unseen objects without requiring object CAD models or category priors. Point2Pose leverages a 2D point tracker to obtain long-range correspondences, enabling instant recovery after complete occlusion. Simultaneously, the system incrementally reconstructs an online Truncated Signed Distance Function (TSDF) representation of the tracked targets. Alongside the method, we introduce a new multi-object tracking dataset comprising both simulation and real-world sequences, with motion-capture ground truth for evaluation. Experiments show that Point2Pose achieves performance comparable to the state-of-the-art methods on a severe-occlusion benchmark, while additionally supporting multi-object tracking and recovery from complete occlusion, capabilities that are not supported by previous model-free tracking approaches.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multi-object 6D pose tracking | YCBMultiTrack Real-World | ADD-S96.49 | 31 | |
| 6D Object Pose Tracking | YCBMultiTrack (synthetic) | ADD-S98.2 | 18 | |
| 6D Object Pose Tracking | YCBInEOAT (test) | ADD (003 Cracker Box)96.37 | 11 | |
| 6D Pose Tracking | HO3D AP11 | ADD-S AUC96.18 | 2 | |
| 6D Pose Tracking | HO3D AP13 | ADD-S AUC96.27 | 2 | |
| 6D Pose Tracking | HO3D MPM10 | ADD-S AUC95.31 | 2 | |
| 6D Pose Tracking | HO3D MPM11 | ADD-S AUC97.08 | 2 | |
| 6D Pose Tracking | HO3D MPM12 | ADD-S AUC97.12 | 2 | |
| 6D Pose Tracking | HO3D MPM13 | ADD-S AUC96.58 | 2 | |
| 6D Pose Tracking | HO3D Mean | ADD-S AUC94.63 | 2 |