BendVLM: Test-Time Debiasing of Vision-Language Embeddings

About

Vision-language model (VLM) embeddings have been shown to encode biases present in their training data, such as societal biases that prescribe negative characteristics to members of various racial and gender identities. VLMs are being quickly adopted for a variety of tasks ranging from few-shot classification to text-guided image generation, making debiasing VLM embeddings crucial. Debiasing approaches that fine-tune the VLM often suffer from catastrophic forgetting. On the other hand, fine-tuning-free methods typically utilize a "one-size-fits-all" approach that assumes that correlation with the spurious attribute can be explained using a single linear direction across all possible inputs. In this work, we propose Bend-VLM, a nonlinear, fine-tuning-free approach for VLM embedding debiasing that tailors the debiasing operation to each unique input. This allows for a more flexible debiasing approach. Additionally, we do not require knowledge of the set of inputs a priori to inference time, making our method more appropriate for online, open-set tasks such as retrieval and text guided image generation.

Walter Gerych, Haoran Zhang, Kimia Hamidieh, Eileen Pan, Maanas Sharma, Thomas Hartvigsen, Marzyeh Ghassemi• 2024

Related benchmarks

Task	Dataset	Result
Social Bias Evaluation	FairFace	MS0.081	54
Bias Mitigation for Stereotype Queries	UTKFACE Gender	KL Divergence0.004	33
Bias Mitigation for Stereotype Queries	UTKFACE Race	KL Divergence0.041	33
Image Retrieval	CelebA Hair Color queries	KL Divergence0.028	24
Image Retrieval	CelebA Stereotype queries	KL Divergence0.03	24
Classification	CelebA Gender (test)	Accuracy83.6	24
Zero-shot classification fairness	CelebA Gender	Accuracy80.9	24
Classification	Waterbirds Background (test)	Accuracy81.9	24
Zero-shot classification fairness	Waterbirds Background	Accuracy (Zero-shot)82.6	24
Fair Image Retrieval	CelebA (test)	KL Divergence0.011	9

Showing 10 of 14 rows

Other info

Code

Follow for update

@wizwand_team Discord