Single-Shot Motion Completion with Transformer

About

Motion completion is a challenging and long-discussed problem, which is of great significance in film and game applications. For different motion completion scenarios (in-betweening, in-filling, and blending), most previous methods deal with the completion problems with case-by-case designs. In this work, we propose a simple but effective method to solve multiple motion completion problems under a unified framework and achieves a new state of the art accuracy under multiple evaluation settings. Inspired by the recent great success of attention-based models, we consider the completion as a sequence to sequence prediction problem. Our method consists of two modules - a standard transformer encoder with self-attention that learns long-range dependencies of input motions, and a trainable mixture embedding module that models temporal information and discriminates key-frames. Our method can run in a non-autoregressive manner and predict multiple missing frames within a single forward propagation in real time. We finally show the effectiveness of our method in music-dance applications.

Yinglin Duan, Tianyang Shi, Zhengxia Zou, Yenan Lin, Zhehui Qian, Bohan Zhang, Yi Yuan (1) __INSTITUTION_7__ NetEase Fuxi AI Lab, (2) University of Michigan, (3) NetEase)• 2021

Related benchmarks

Task	Dataset	Result
Motion In-betweening	LaFAN1 (test)	L2Q0.14	77
Motion In-filling	Anidance (test)	L2P0.84	27
Dance blending	New dance dataset (test)	L2Q1.62	12
Motion In-betweening	LaFAN1 Subject 5 (test)	--	12

Showing 4 of 4 rows

Other info

Code

Follow for update

@wizwand_team Discord