Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Single-Shot Motion Completion with Transformer

About

Motion completion is a challenging and long-discussed problem, which is of great significance in film and game applications. For different motion completion scenarios (in-betweening, in-filling, and blending), most previous methods deal with the completion problems with case-by-case designs. In this work, we propose a simple but effective method to solve multiple motion completion problems under a unified framework and achieves a new state of the art accuracy under multiple evaluation settings. Inspired by the recent great success of attention-based models, we consider the completion as a sequence to sequence prediction problem. Our method consists of two modules - a standard transformer encoder with self-attention that learns long-range dependencies of input motions, and a trainable mixture embedding module that models temporal information and discriminates key-frames. Our method can run in a non-autoregressive manner and predict multiple missing frames within a single forward propagation in real time. We finally show the effectiveness of our method in music-dance applications.

Yinglin Duan, Tianyang Shi, Zhengxia Zou, Yenan Lin, Zhehui Qian, Bohan Zhang, Yi Yuan (1) __INSTITUTION_7__ NetEase Fuxi AI Lab, (2) University of Michigan, (3) NetEase)• 2021

Related benchmarks

TaskDatasetResultRank
Motion In-betweeningLaFAN1 (test)
L2Q0.14
77
Motion In-fillingAnidance (test)
L2P0.84
27
Dance blendingNew dance dataset (test)
L2Q1.62
12
Motion In-betweeningLaFAN1 Subject 5 (test)--
12
Showing 4 of 4 rows

Other info

Code

Follow for update