Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

GFPose: Learning 3D Human Pose Prior with Gradient Fields

About

Learning 3D human pose prior is essential to human-centered AI. Here, we present GFPose, a versatile framework to model plausible 3D human poses for various applications. At the core of GFPose is a time-dependent score network, which estimates the gradient on each body joint and progressively denoises the perturbed 3D human pose to match a given task specification. During the denoising process, GFPose implicitly incorporates pose priors in gradients and unifies various discriminative and generative tasks in an elegant framework. Despite the simplicity, GFPose demonstrates great potential in several downstream tasks. Our experiments empirically show that 1) as a multi-hypothesis pose estimator, GFPose outperforms existing SOTAs by 20% on Human3.6M dataset. 2) as a single-hypothesis pose estimator, GFPose achieves comparable results to deterministic SOTAs, even with a vanilla backbone. 3) GFPose is able to produce diverse and realistic samples in pose denoising, completion and generation tasks. Project page https://sites.google.com/view/gfpose/

Hai Ci, Mingdong Wu, Wentao Zhu, Xiaoxuan Ma, Hao Dong, Fangwei Zhong, Yizhou Wang• 2022

Related benchmarks

TaskDatasetResultRank
3D Human Pose EstimationMPI-INF-3DHP (test)
PCK86.9
559
3D Human Pose EstimationHuman3.6M (test)
MPJPE (Average)38.4
547
3D Human Pose EstimationHuman3.6M (Protocol #1)
MPJPE (Avg.)45.1
440
3D Human Pose EstimationHuman3.6M (S9, S11)
Average Error (MPJPE Avg)45.1
94
3D Pose Estimation3DHP--
25
3D Human Pose EstimationHuman3.6M Standard Protocol
MPJPE35.8
19
3D Human Pose EstimationHuman 3.6M Subjects 9 & 11 (test)
MPJPE35.6
16
3D Human Pose EstimationMPI-INF-3DHP sampled 2929 frame (test)
MPJPE86.9
15
Pose DenoisingAMASS (test)
Delta Q + m2m0.359
10
Pose GenerationAMASS (test)
FID1.246
8
Showing 10 of 12 rows

Other info

Code

Follow for update