Learning Parallel Dense Correspondence from Spatio-Temporal Descriptors for Efficient and Robust 4D Reconstruction
About
This paper focuses on the task of 4D shape reconstruction from a sequence of point clouds. Despite the recent success achieved by extending deep implicit representations into 4D space, it is still a great challenge in two respects, i.e. how to design a flexible framework for learning robust spatio-temporal shape representations from 4D point clouds, and develop an efficient mechanism for capturing shape dynamics. In this work, we present a novel pipeline to learn a temporal evolution of the 3D human shape through spatially continuous transformation functions among cross-frame occupancy fields. The key idea is to parallelly establish the dense correspondence between predicted occupancy fields at different time steps via explicitly learning continuous displacement vector fields from robust spatio-temporal shape representations. Extensive comparisons against previous state-of-the-arts show the superior accuracy of our approach for 4D human reconstruction in the problems of 4D shape auto-encoding and completion, and a much faster network inference with about 8 times speedup demonstrates the significant efficiency of our approach. The trained models and implementation code are available at https://github.com/tangjiapeng/LPDC-Net.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Dynamic Human Body Modeling | D-FAUST 5 (Seen Individual) | Chamfer Distance0.055 | 18 | |
| Dynamic Human Body Modeling | D-FAUST 5 (Unseen Individual) | IoU7.62e+3 | 12 | |
| Flow Estimation | D-FAUST S1 Seen Individuals Unseen Motions (test) | Correspondence L2 Distance0.2091 | 8 | |
| 4D Shape Completion | D-FAUST (Unseen Motion) | IoU84.9 | 8 | |
| 4D Shape Completion | D-FAUST (unseen individual) | IoU76.2 | 8 | |
| 4D Shape Completion | DT4D-A (Unseen Motion) | IoU72.4 | 8 | |
| 4D Shape Completion | DT4D-A (Unseen Individual) | IoU59.4 | 8 | |
| 4D Reconstruction and Flow Estimation | D-FAUST (test) | Training Time (sec/iter)2.09 | 8 | |
| 4D Reconstruction | D-FAUST S1 Seen Individuals Unseen Motions (test) | Chamfer Distance (x10^-3)152.6 | 7 | |
| 4D Reconstruction | D-FAUST S2: Unseen Individuals, Seen Motions (test) | Chamfer Distance2.19e-4 | 7 |