Iterative Greedy Matching for 3D Human Pose Tracking from Multiple Views
About
In this work we propose an approach for estimating 3D human poses of multiple people from a set of calibrated cameras. Estimating 3D human poses from multiple views has several compelling properties: human poses are estimated within a global coordinate space and multiple cameras provide an extended field of view which helps in resolving ambiguities, occlusions and motion blur. Our approach builds upon a real-time 2D multi-person pose estimation system and greedily solves the association problem between multiple views. We utilize bipartite matching to track multiple people over multiple frames. This proofs to be especially efficient as problems associated with greedy matching such as occlusion can be easily resolved in 3D. Our approach achieves state-of-the-art results on popular benchmarks and may serve as a baseline for future work.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Human Pose Estimation | Campus (test) | Actor 1 Score98 | 66 | |
| 3D Human Pose Estimation | Campus | PCP84.1 | 36 | |
| 3D Multi-person Pose Estimation | Shelf (test) | Actor 1 Score99.8 | 27 | |
| 3D Human Pose Estimation | Shelf (test) | Actor 1 Score99.8 | 27 | |
| 3D Human Pose Estimation | KTH Multiview Football II (Sequence 1 of Player 2) | Upper Arms PCP100 | 17 | |
| 3D Multi-person Pose Estimation | MVOR 23 (test) | MPJPE (mm)235 | 16 | |
| Multi-person 3D Pose Estimation | Campus frames 350-470, 650-750 (test) | PCP (ua)99 | 15 | |
| Multi-person 3D Pose Estimation | Shelf frames 300-600 (test) | PCP (ua)1 | 15 | |
| 3D Human Pose Estimation | Chi3D | Invalid Rate220 | 14 | |
| 3D Human Pose Estimation | Human3.6M (S9) | PCP56.1 | 14 |