Bilevel Online Adaptation for Out-of-Domain Human Mesh Reconstruction
About
This paper considers a new problem of adapting a pre-trained model of human mesh reconstruction to out-of-domain streaming videos. However, most previous methods based on the parametric SMPL model \cite{loper2015smpl} underperform in new domains with unexpected, domain-specific attributes, such as camera parameters, lengths of bones, backgrounds, and occlusions. Our general idea is to dynamically fine-tune the source model on test video streams with additional temporal constraints, such that it can mitigate the domain gaps without over-fitting the 2D information of individual test frames. A subsequent challenge is how to avoid conflicts between the 2D and temporal constraints. We propose to tackle this problem using a new training algorithm named Bilevel Online Adaptation (BOA), which divides the optimization process of overall multi-objective into two steps of weight probe and weight update in a training iteration. We demonstrate that BOA leads to state-of-the-art results on two human mesh reconstruction benchmarks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Human Pose Estimation | MPI-INF-3DHP (test) | PCK90.3 | 559 | |
| 3D Human Pose Estimation | 3DPW (test) | PA-MPJPE49.5 | 505 | |
| 3D Human Mesh Recovery | 3DPW (test) | PA-MPJPE46.2 | 264 | |
| 3D Human Pose Estimation | 3DPW | PA-MPJPE76.2 | 119 | |
| 3D Human Pose and Shape Estimation | 3DPW | PA-MPJPE49.5 | 74 | |
| 3D Human Mesh Estimation | 3DPW (test) | PA-MPJPE61.7 | 44 | |
| Human Mesh Reconstruction | MPI-INF-3DHP (test) | MPJPE117.6 | 36 | |
| 3D Human Pose Estimation | 3DPW cross-dataset (test) | PA-MPJPE49.5 | 27 | |
| 3D Pose Estimation | 3DHP | MPJPE135.3 | 25 | |
| Human Mesh Reconstruction | 3DPW (test) | PA-MPJPE49.5 | 18 |