Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Faster VoxelPose: Real-time 3D Human Pose Estimation by Orthographic Projection

About

While the voxel-based methods have achieved promising results for multi-person 3D pose estimation from multi-cameras, they suffer from heavy computation burdens, especially for large scenes. We present Faster VoxelPose to address the challenge by re-projecting the feature volume to the three two-dimensional coordinate planes and estimating X, Y, Z coordinates from them separately. To that end, we first localize each person by a 3D bounding box by estimating a 2D box and its height based on the volume features projected to the xy-plane and z-axis, respectively. Then for each person, we estimate partial joint coordinates from the three coordinate planes separately which are then fused to obtain the final 3D pose. The method is free from costly 3D-CNNs and improves the speed of VoxelPose by ten times and meanwhile achieves competitive accuracy as the state-of-the-art methods, proving its potential in real-time applications.

Hang Ye, Wentao Zhu, Chunyu Wang, Rujie Wu, Yizhou Wang• 2022

Related benchmarks

TaskDatasetResultRank
3D Human Pose EstimationCampus (test)
Actor 1 Score96.5
66
3D Human Pose EstimationCampus
PCP96.2
36
3D Pose Estimationshelf
PCP Actor 199.4
25
Multi-person 3D Human Pose EstimationCMU Panoptic (test)
MPJPE (Average)18.26
22
3D Multi-person Pose EstimationMVOR 23 (test)
MPJPE (mm)109
16
3D Human Pose EstimationHuman3.6M (S9)
PCP91.3
14
3D Human Pose EstimationCMU Panoptic JLT+15 (test)
MPJPE18.26
14
Multi-person 3D Pose EstimationShelf (transfer)
PCP99.1
13
3D Multi-person Pose EstimationPanoptic (test)
PCP99.4
12
3D Multi-person Pose EstimationHuman3.6M, Shelf, Campus, and MVOR Averaged Generalization
PCP73.5
12
Showing 10 of 17 rows

Other info

Follow for update