Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SRNet: Improving Generalization in 3D Human Pose Estimation with a Split-and-Recombine Approach

About

Human poses that are rare or unseen in a training set are challenging for a network to predict. Similar to the long-tailed distribution problem in visual recognition, the small number of examples for such poses limits the ability of networks to model them. Interestingly, local pose distributions suffer less from the long-tail problem, i.e., local joint configurations within a rare pose may appear within other poses in the training set, making them less rare. We propose to take advantage of this fact for better generalization to rare and unseen poses. To be specific, our method splits the body into local regions and processes them in separate network branches, utilizing the property that a joint position depends mainly on the joints within its local body region. Global coherence is maintained by recombining the global context from the rest of the body into each branch as a low-dimensional vector. With the reduced dimensionality of less relevant body areas, the training set distribution within network branches more closely reflects the statistics of local poses instead of global body poses, without sacrificing information important for joint inference. The proposed split-and-recombine approach, called SRNet, can be easily adapted to both single-image and temporal models, and it leads to appreciable improvements in the prediction of rare and unseen poses.

Ailing Zeng, Xiao Sun, Fuyang Huang, Minhao Liu, Qiang Xu, Stephen Lin• 2020

Related benchmarks

TaskDatasetResultRank
3D Human Pose EstimationMPI-INF-3DHP (test)
PCK82.1
559
3D Human Pose EstimationHuman3.6M (test)
MPJPE (Average)32
547
3D Human Pose EstimationHuman3.6M (Protocol #1)
MPJPE (Avg.)32
440
3D Human Pose EstimationHuman3.6M (Protocol 2)
Average MPJPE36.6
315
3D Human Pose EstimationHuman3.6M Protocol 1 (test)
Dir. Error (Protocol 1)31.2
183
3D Human Pose EstimationHuman3.6M (subjects 9 and 11)
Average Error32
180
3D Human Pose EstimationHuman3.6M
MPJPE32
160
3D Human Pose EstimationMPI-INF-3DHP
PCK77.6
108
3D Human Pose EstimationHuman3.6M (S9, S11)
Average Error (MPJPE Avg)44.8
94
3D Human Pose EstimationHuman3.6M S9 and S11 (test)
Dir. Error35.8
72
Showing 10 of 17 rows

Other info

Follow for update