Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MotionMixer: MLP-based 3D Human Body Pose Forecasting

About

In this work, we present MotionMixer, an efficient 3D human body pose forecasting model based solely on multi-layer perceptrons (MLPs). MotionMixer learns the spatial-temporal 3D body pose dependencies by sequentially mixing both modalities. Given a stacked sequence of 3D body poses, a spatial-MLP extracts fine grained spatial dependencies of the body joints. The interaction of the body joints over time is then modelled by a temporal MLP. The spatial-temporal mixed features are finally aggregated and decoded to obtain the future motion. To calibrate the influence of each time step in the pose sequence, we make use of squeeze-and-excitation (SE) blocks. We evaluate our approach on Human3.6M, AMASS, and 3DPW datasets using the standard evaluation protocols. For all evaluations, we demonstrate state-of-the-art performance, while having a model with a smaller number of parameters. Our code is available at: https://github.com/MotionMLP/MotionMixer

Arij Bouazizi, Adrian Holzbock, Ulrich Kressel, Klaus Dietmayer, Vasileios Belagiannis• 2022

Related benchmarks

TaskDatasetResultRank
Human Motion PredictionHuman3.6M (test)--
85
Human Motion PredictionHuman3.6M--
46
Human Motion Prediction3DPW
Trajectory Error (400ms)22.8
27
3D Human Motion PredictionHuman3.6M S5 (test)
Average MPJPE (560ms)46.1
17
3D Human Pose PredictionHuman3.6M
Avg 3D Error (160ms)13.2
16
3D Pose Forecasting (Joint Angles)Human3.6M
MAE @ 80ms0.2
15
3D Hand Pose EstimationTED Hands (test)
L2 Error2.324
14
3D Hand Gesture GenerationB2H dataset (test)
FHD2.169
8
3D hand gesture samplingTED Hands dataset (test)
FHD1.613
8
3D hand predictionB2H dataset (test)
L2 Error3.616
7
Showing 10 of 11 rows

Other info

Code

Follow for update