Coupled Iterative Refinement for 6D Multi-Object Pose Estimation
About
We address the task of 6D multi-object pose: given a set of known 3D objects and an RGB or RGB-D input image, we detect and estimate the 6D pose of each object. We propose a new approach to 6D object pose estimation which consists of an end-to-end differentiable architecture that makes use of geometric knowledge. Our approach iteratively refines both pose and correspondence in a tightly coupled manner, allowing us to dynamically remove outliers to improve accuracy. We use a novel differentiable layer to perform pose refinement by solving an optimization problem we refer to as Bidirectional Depth-Augmented Perspective-N-Point (BD-PnP). Our method achieves state-of-the-art accuracy on standard 6D Object Pose benchmarks. Code is available at https://github.com/princeton-vl/Coupled-Iterative-Refinement.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 6DoF Pose Estimation | YCB-Video (test) | -- | 72 | |
| 6D Object Pose Estimation | BOP Core Datasets Challenge (test) | LM-O Score73.4 | 42 | |
| 6D Object Pose Estimation | BOP (T-LESS, ITODD, YCB-V, LM-O) Challenge (test) | LM-O Score65.5 | 13 | |
| 6D Pose Estimation | BOP Benchmark (test) | LM-O Score73.4 | 11 | |
| 6D Object Pose Refinement | YCB-V | Avg. Success82.4 | 9 | |
| 6D Object Pose Estimation | T-LESS (test) | AR71.5 | 6 | |
| 6D Object Pose Refinement | LM-O | Avg Error0.655 | 5 | |
| 6D Object Pose Refinement | BOP datasets (YCB-V, LM-O) (test) | Timing (ms)1.10e+4 | 5 |