Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

OFER: Occluded Face Expression Reconstruction

About

Reconstructing 3D face models from a single image is an inherently ill-posed problem, which becomes even more challenging in the presence of occlusions. In addition to fewer available observations, occlusions introduce an extra source of ambiguity where multiple reconstructions can be equally valid. Despite the ubiquity of the problem, very few methods address its multi-hypothesis nature. In this paper we introduce OFER, a novel approach for single-image 3D face reconstruction that can generate plausible, diverse, and expressive 3D faces, even under strong occlusions. Specifically, we train two diffusion models to generate the shape and expression coefficients of a face parametric model, conditioned on the input image. This approach captures the multi-modal nature of the problem, generating a distribution of solutions as output. However, to maintain consistency across diverse expressions, the challenge is to select the best matching shape. To achieve this, we propose a novel ranking mechanism that sorts the outputs of the shape diffusion network based on predicted shape accuracy scores. We evaluate our method using standard benchmarks and introduce CO-545, a new protocol and dataset designed to assess the accuracy of expressive faces under occlusion. Our results show improved performance over occlusion-based methods, while also enabling the generation of diverse expressions for a given image.

Pratheba Selvaraju, Victoria Fernandez Abrevaya, Timo Bolkart, Rick Akkerman, Tianyu Ding, Faezeh Amjadi, Ilya Zharkov• 2024

Related benchmarks

TaskDatasetResultRank
Neutral Face ReconstructionNoW full (val)
Median Error0.81
12
3D Metrical ReconstructionNoW (test)
Median Error (mm)1.27
10
Neutral Face ReconstructionNoW unoccluded (val)
Median Error0.81
8
Neutral Face ReconstructionNoW occluded (val)
Median Error0.84
8
Expression ReconstructionCO-545
CSE0.17
2
Expression ReconstructionDey unoccluded
Neutral RMSE1.9
2
Expression ReconstructionCO-545 unoccluded
RMSE (Neutral)1.95
2
Expression ReconstructionCO-545 occluded
Neutral RMSE3.48
2
Expression ReconstructionDataset mask
STD-S34.04
2
Expression ReconstructionDataset sunglasses
STD-S34.38
2
Showing 10 of 11 rows

Other info

Follow for update