Programmatic Concept Learning for Human Motion Description and Synthesis
About
We introduce Programmatic Motion Concepts, a hierarchical motion representation for human actions that captures both low-level motion and high-level description as motion concepts. This representation enables human motion description, interactive editing, and controlled synthesis of novel video sequences within a single framework. We present an architecture that learns this concept representation from paired video and action sequences in a semi-supervised manner. The compactness of our representation also allows us to present a low-resource training recipe for data-efficient learning. By outperforming established baselines, especially in the small data regime, we demonstrate the efficiency and effectiveness of our framework for multiple applications.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Human Motion Recognition | MotiCon (test) | NormED0.0847 | 11 | |
| Human Motion Localization | MotiCon (test) | mAP45.43 | 7 | |
| Controlled motion synthesis | MotiCon (test) | APE0.1531 | 5 | |
| Video Synthesis | MotiCon (test) | PSNR19.355 | 5 | |
| Action-conditioned motion synthesis | MotiCon | FID2.406 | 4 | |
| Motion Synthesis | GolfDB | KD0.529 | 3 | |
| Motion Synthesis | MotiCon | KD22.9 | 3 |