Viewpoints and Keypoints
About
We characterize the problem of pose estimation for rigid objects in terms of determining viewpoint to explain coarse pose and keypoint prediction to capture the finer details. We address both these tasks in two different settings - the constrained setting with known bounding boxes and the more challenging detection setting where the aim is to simultaneously detect and correctly estimate pose of objects. We present Convolutional Neural Network based architectures for these and demonstrate that leveraging viewpoint estimates can substantially improve local appearance based keypoint predictions. In addition to achieving significant improvements over state-of-the-art in the above tasks, we analyze the error modes and effect of object characteristics on performance to guide future efforts towards this goal.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Viewpoint Estimation | PASCAL3D+ | Aero Error Rate13.8 | 20 | |
| Rotation Prediction | PASCAL3D+ (test) | Average Rotation Error13.6 | 10 | |
| 2D Keypoint Localization | PASCAL3D+ (test) | Aero Acc66 | 6 | |
| Keypoint Localization | PASCAL VOC 2012 | Aero Acc66 | 5 | |
| Viewpoint Estimation | Pascal3D+ v1.0 (test) | Aeroplane Error0.81 | 5 | |
| Viewpoint Estimation | Pascal3D+ novel categories 44 | Motorcycle Viewpoint Accuracy58 | 4 | |
| Viewpoint Prediction | PASCAL3D+ (val) | AVP (4 bins)49.1 | 4 |