Multi-View Multi-Person 3D Pose Estimation with Plane Sweep Stereo
About
Existing approaches for multi-view multi-person 3D pose estimation explicitly establish cross-view correspondences to group 2D pose detections from multiple camera views and solve for the 3D pose estimation for each person. Establishing cross-view correspondences is challenging in multi-person scenes, and incorrect correspondences will lead to sub-optimal performance for the multi-stage pipeline. In this work, we present our multi-view 3D pose estimation approach based on plane sweep stereo to jointly address the cross-view fusion and 3D pose reconstruction in a single shot. Specifically, we propose to perform depth regression for each joint of each 2D pose in a target camera view. Cross-view consistency constraints are implicitly enforced by multiple reference camera views via the plane sweep algorithm to facilitate accurate depth regression. We adopt a coarse-to-fine scheme to first regress the person-level depth followed by a per-person joint-level relative depth estimation. 3D poses are obtained from a simple back-projection given the estimated depths. We evaluate our approach on benchmark datasets where it outperforms previous state-of-the-arts while being remarkably efficient. Our code is available at https://github.com/jiahaoLjh/PlaneSweepPose.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Human Pose Estimation | Campus (test) | Actor 1 Score98.4 | 66 | |
| 3D Human Pose Estimation | Shelf (test) | Actor 1 Score99.3 | 27 | |
| Multi-view multi-person 3D pose estimation | Campus | PCP (Actor 1)98.4 | 26 | |
| Multi-person 3D Human Pose Estimation | CMU Panoptic (test) | MPJPE (Average)16.75 | 22 | |
| 3D Human Pose Estimation | CMU Panoptic JLT+15 (test) | MPJPE16.75 | 14 | |
| 3D Multi-person Pose Estimation (In-domain) | Shelf 2 (test) | PCP97.9 | 12 | |
| 3D Multi-person Pose Estimation (In-domain) | Campus 2 (test) | PCP97 | 11 | |
| Multi-view multi-person 3D pose estimation | Shelf (test) | PCP3D (Actor 1)99.3 | 10 | |
| Multi-person 3D Pose Estimation | Panoptic | MPJPE (mm)16.3 | 10 | |
| Multi-view multi-person 3D pose estimation | shelf | PCP (Actor 1)99.3 | 8 |