Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FairViT: Fair Vision Transformer via Adaptive Masking

About

Vision Transformer (ViT) has achieved excellent performance and demonstrated its promising potential in various computer vision tasks. The wide deployment of ViT in real-world tasks requires a thorough understanding of the societal impact of the model. However, most ViT-based works do not take fairness into account and it is unclear whether directly applying CNN-oriented debiased algorithm to ViT is feasible. Moreover, previous works typically sacrifice accuracy for fairness. Therefore, we aim to develop an algorithm that improves accuracy without sacrificing fairness. In this paper, we propose FairViT, a novel accurate and fair ViT framework. To this end, we introduce a novel distance loss and deploy adaptive fairness-aware masks on attention layers updating with model parameters. Experimental results show \sys can achieve accuracy better than other alternatives, even with competitive computational efficiency. Furthermore, \sys achieves appreciable fairness results.

Bowei Tian, Ruijie Du, Yanning Shen• 2024

Related benchmarks

TaskDatasetResultRank
ClassificationCelebA
Avg Accuracy89.3
197
Facial Attribute ClassificationCelebA
Accuracy89.3
163
ClassificationCelebA (test)
Average Accuracy83.8
92
Image ClassificationCelebA (test)
Accuracy99
82
Gender ClassificationUTKFace (test)
Gender Accuracy97.5
14
Image-Based ClassificationCelebA
Accuracy92.7
10
Big Nose Classification (Sensitive Attribute: Gender (Male))CelebA
Accuracy81.9
10
Gender ClassificationUTKFace
Accuracy97.5
9
Attribute ClassificationCelebA T=s, S=m
Accuracy94.27
7
Attribute ClassificationCelebA T=a, S=br
Accuracy82.52
7
Showing 10 of 22 rows

Other info

Follow for update