Image-Guided Shape-from-Template Using Mesh Inextensibility Constraints
About
Shape-from-Template (SfT) refers to the class of methods that reconstruct the 3D shape of a deforming object from images/videos using a 3D template. Traditional SfT methods require point correspondences between images and the texture of the 3D template in order to reconstruct 3D shapes from images/videos in real time. Their performance severely degrades when encountered with severe occlusions in the images because of the unavailability of correspondences. In contrast, modern SfT methods use a correspondence-free approach by incorporating deep neural networks to reconstruct 3D objects, thus requiring huge amounts of data for supervision. Recent advances use a fully unsupervised or self-supervised approach by combining differentiable physics and graphics to deform 3D template to match input images. In this paper, we propose an unsupervised SfT which uses only image observations: color features, gradients and silhouettes along with a mesh inextensibility constraint to reconstruct at a $400\times$ faster pace than (best-performing) unsupervised SfT. Moreover, when it comes to generating finer details and severe occlusions, our method outperforms the existing methodologies by a large margin. Code is available at https://github.com/dvttran/nsft.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Mesh Reconstruction | ϕ-SfT synthetic (S1) | Average Mesh Error0.0229 | 4 | |
| 3D Mesh Reconstruction | ϕ-SfT synthetic (S4) | Average Mesh Error0.0031 | 4 | |
| 3D Mesh Reconstruction | ϕ-SfT synthetic (S2) | Average Mesh Error0.0254 | 4 | |
| 3D Mesh Reconstruction | ϕ-SfT S3 synthetic | Average Mesh Error0.0357 | 4 | |
| Depth Map Reconstruction | Kinect Paper dataset | RMSE (mm)4.01 | 4 | |
| 3D Reconstruction | phi-SfT real (R1) | Average Chamfer Distance6.60e-5 | 3 | |
| 3D Reconstruction | phi-SfT real (R2) | Avg Chamfer Distance (x 10^4)1.3 | 3 | |
| 3D Reconstruction | phi-SfT real (R3) | Avg Chamfer Distance (x 10^4)4.49 | 3 | |
| 3D Reconstruction | phi-SfT real (R4) | Avg Chamfer Distance (x10^4)8.15 | 3 | |
| 3D Reconstruction | phi-SfT real (R5) | Avg Chamfer Distance (x 10^4)8.04 | 3 |