Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Practical Face Reconstruction via Differentiable Ray Tracing

About

We present a differentiable ray-tracing based novel face reconstruction approach where scene attributes - 3D geometry, reflectance (diffuse, specular and roughness), pose, camera parameters, and scene illumination - are estimated from unconstrained monocular images. The proposed method models scene illumination via a novel, parameterized virtual light stage, which in-conjunction with differentiable ray-tracing, introduces a coarse-to-fine optimization formulation for face reconstruction. Our method can not only handle unconstrained illumination and self-shadows conditions, but also estimates diffuse and specular albedos. To estimate the face attributes consistently and with practical semantics, a two-stage optimization strategy systematically uses a subset of parametric attributes, where subsequent attribute estimations factor those previously estimated. For example, self-shadows estimated during the first stage, later prevent its baking into the personalized diffuse and specular albedos in the second stage. We show the efficacy of our approach in several real-world scenarios, where face attributes can be estimated even under extreme illumination conditions. Ablation studies, analyses and comparisons against several recent state-of-the-art methods show improved accuracy and versatility of our approach. With consistent face attributes reconstruction, our method leads to several style -- illumination, albedo, self-shadow -- edit and transfer applications, as discussed in the paper.

Abdallah Dib, Gaurav Bharaj, Junghyun Ahn, C\'edric Th\'ebault, Philippe-Henri Gosselin, Marco Romeo, Louis Chevallier• 2021

Related benchmarks

TaskDatasetResultRank
3D Face ReconstructionVoxceleb2 single images Source
PSNR33.41
6
3D Face ReconstructionVoxceleb2 video sequences (Source)
PSNR30.65
6
Face ReconstructionImages with Diverse Shadows 100 images
PSNR32.1
6
3D Face SynthesisVoxceleb 2 (video sequences (Target))
PSNR24.32
6
3D Face SynthesisVoxceleb2 single images Target
PSNR23.74
6
Texture RecoveryImages with Diverse Shadows 100 images
PSNR21.83
6
3D Face Reconstruction3DFAW, AFLW2000, and wikihuman (23 images) (test)
Position Error Mean (cm)0.174
5
Diffuse Albedo ReconstructionMaya renders
SSIM72.2
5
Specular Albedo ReconstructionMaya renders
SSIM54.7
5
Final Image ReconstructionMaya renders
SSIM0.965
3
Showing 10 of 10 rows

Other info

Follow for update