Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Compositional Representation for 4D Captures with Neural ODE

About

Learning based representation has become the key to the success of many computer vision systems. While many 3D representations have been proposed, it is still an unaddressed problem how to represent a dynamically changing 3D object. In this paper, we introduce a compositional representation for 4D captures, i.e. a deforming 3D object over a temporal span, that disentangles shape, initial state, and motion respectively. Each component is represented by a latent code via a trained encoder. To model the motion, a neural Ordinary Differential Equation (ODE) is trained to update the initial state conditioned on the learned motion code, and a decoder takes the shape code and the updated state code to reconstruct the 3D model at each time stamp. To this end, we propose an Identity Exchange Training (IET) strategy to encourage the network to learn effectively decoupling each component. Extensive experiments demonstrate that the proposed method outperforms existing state-of-the-art deep learning based methods on 4D reconstruction, and significantly improves on various tasks, including motion transfer and completion.

Boyan Jiang, Yinda Zhang, Xingkui Wei, Xiangyang Xue, Yanwei Fu• 2021

Related benchmarks

TaskDatasetResultRank
Dynamic Human Body ModelingD-FAUST 5 (Seen Individual)
Chamfer Distance0.068
18
Dynamic Human Body ModelingD-FAUST 5 (Unseen Individual)
IoU6.99e+3
12
4D ReconstructionD-FAUST S1 Seen Individuals Unseen Motions (test)
Chamfer Distance (x10^-3)166.7
7
4D ReconstructionD-FAUST S2: Unseen Individuals, Seen Motions (test)
Chamfer Distance2.22e-4
7
Motion RetargetingCAPE (test)
PA-MPJPE52.2
5
Shape and Motion RecoveryCAPE (test)
PA-MPJPE49.8
5
Future PredictionCAPE
PA-MPJPE91.9
4
4D ReconstructionCAPE (test)
IoU62.9
3
Future PredictionCAPE (test)
IoU64
3
Motion CompletionCAPE (test)
IoU76.6
3
Showing 10 of 11 rows

Other info

Follow for update