Convolutional Sequence to Sequence Model for Human Dynamics
About
Human motion modeling is a classic problem in computer vision and graphics. Challenges in modeling human motion include high dimensional prediction as well as extremely complicated dynamics.We present a novel approach to human motion modeling based on convolutional neural networks (CNN). The hierarchical structure of CNN makes it capable of capturing both spatial and temporal correlations effectively. In our proposed approach,a convolutional long-term encoder is used to encode the whole given motion sequence into a long-term hidden variable, which is used with a decoder to predict the remainder of the sequence. The decoder itself also has an encoder-decoder structure, in which the short-term encoder encodes a shorter sequence to a short-term hidden variable, and the spatial decoder maps the long and short-term hidden variable to motion predictions. By using such a model, we are able to capture both invariant and dynamic information of human motion, which results in more accurate predictions. Experiments show that our algorithm outperforms the state-of-the-art methods on the Human3.6M and CMU Motion Capture datasets. Our code is available at the project website.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Human Motion Prediction | Human3.6M (test) | MPJPE16.6 | 85 | |
| Long-term Human Motion Prediction | Human3.6M | Average Error (MPJPE)90.7 | 58 | |
| Human Motion Prediction | Human3.6M | MAE (1000ms)0.92 | 46 | |
| 3D Human Motion Prediction | 3DPW (test) | MPJPE (mm)18.8 | 40 | |
| Short-term motion prediction | Human 3.6M short-term motion prediction (test) | Avg MAE (Walking)17.1 | 40 | |
| 3D joint position forecasting | Human3.6M | Walking Error17.1 | 40 | |
| Human Pose Forecasting | AMASS BMLrub (test) | MPJPE (mm)20.6 | 40 | |
| Human Motion Prediction | Human3.6M (short-term) | -- | 40 | |
| Human Motion Prediction | 3DPW | Trajectory Error (400ms)1.85 | 27 | |
| Human Pose Prediction | Human 3.6M Subject 5 (test) | Walking Error (Avg)0.33 | 24 |