Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

STAR Loss: Reducing Semantic Ambiguity in Facial Landmark Detection

About

Recently, deep learning-based facial landmark detection has achieved significant improvement. However, the semantic ambiguity problem degrades detection performance. Specifically, the semantic ambiguity causes inconsistent annotation and negatively affects the model's convergence, leading to worse accuracy and instability prediction. To solve this problem, we propose a Self-adapTive Ambiguity Reduction (STAR) loss by exploiting the properties of semantic ambiguity. We find that semantic ambiguity results in the anisotropic predicted distribution, which inspires us to use predicted distribution to represent semantic ambiguity. Based on this, we design the STAR loss that measures the anisotropism of the predicted distribution. Compared with the standard regression loss, STAR loss is encouraged to be small when the predicted distribution is anisotropic and thus adaptively mitigates the impact of semantic ambiguity. Moreover, we propose two kinds of eigenvalue restriction methods that could avoid both distribution's abnormal change and the model's premature convergence. Finally, the comprehensive experiments demonstrate that STAR loss outperforms the state-of-the-art methods on three benchmarks, i.e., COFW, 300W, and WFLW, with negligible computation overhead. Code is at https://github.com/ZhenglinZhou/STAR.

Zhenglin Zhou, Huaxia Li, Hong Liu, Nanyang Wang, Gang Yu, Rongrong Ji• 2023

Related benchmarks

TaskDatasetResultRank
Facial Landmark Detection300-W (Common)
NME2.52
180
Facial Landmark Detection300-W (Fullset)
Mean Error (%)2.87
174
Facial Landmark Detection300W (Challenging)
NME4.32
159
Facial Landmark DetectionWFLW (test)
Mean Error (ME) - All4.02
122
Facial Landmark DetectionCOFW (test)
NME0.0321
93
Face Alignment300W (Challenging)
NME4.32
93
Face Alignment300W Common
NME2.52
90
Face Alignment300-W (Full)
NME2.87
66
Facial Landmark DetectionWFLW (Full)
NME (%)4.02
65
Facial Landmark Detection300W
NME2.87
52
Showing 10 of 22 rows

Other info

Code

Follow for update