Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

TIMotion: Temporal and Interactive Framework for Efficient Human-Human Motion Generation

About

Human-human motion generation is essential for understanding humans as social beings. Current methods fall into two main categories: single-person-based methods and separate modeling-based methods. To delve into this field, we abstract the overall generation process into a general framework MetaMotion, which consists of two phases: temporal modeling and interaction mixing. For temporal modeling, the single-person-based methods concatenate two people into a single one directly, while the separate modeling-based methods skip the modeling of interaction sequences. The inadequate modeling described above resulted in sub-optimal performance and redundant model parameters. In this paper, we introduce TIMotion (Temporal and Interactive Modeling), an efficient and effective framework for human-human motion generation. Specifically, we first propose Causal Interactive Injection to model two separate sequences as a causal sequence leveraging the temporal and causal properties. Then we present Role-Evolving Scanning to adjust to the change in the active and passive roles throughout the interaction. Finally, to generate smoother and more rational motion, we design Localized Pattern Amplification to capture short-term motion patterns. Extensive experiments on InterHuman and InterX demonstrate that our method achieves superior performance. Project page: https://aigc-explorer.github.io/TIMotion-page/

Yabiao Wang, Shuo Wang, Jiangning Zhang, Ke Fan, Jiafu Wu, Zhucun Xue, Yong Liu• 2024

Related benchmarks

TaskDatasetResultRank
Interactive Motion SynthesisInterHuman (test)
R Precision (Top 1)49.1
37
text-conditioned human interaction generationInterHuman (test)
R Precision (Top 1)48.5
27
Human-Human Motion GenerationInterX (test)
R Precision Top 141.4
12
Human-Human Motion GenerationInterHuman (test)
Top-1 R Precision50.1
11
Human-Human Interaction generationInterX SMPLX-based (test)
R-Prec@10.412
8
3D Multi-human Motion EditingInterGen (test)
FID0.4451
5
Human Interaction GenerationInterHuman and InterX (test)
PV48.5
5
Text-conditioned Human-Human Interaction GenerationInterHuman and InterX
AITS1.472
3
motion in-betweening editingInterHuman (test)
R Precision Top 151.6
2
Showing 9 of 9 rows

Other info

Follow for update