BiasEdit: A Training-Free Bias-Detect-and-Edit Framework for Learning Fair Visual Classifiers

About

Visual data from the Web power image classifiers, which often underpin many web services, such as recommendation and content moderation. However, the raw Web data often contain spurious correlations and social biases, and neural networks are known for their tendency to learn biases present in data. This can reinforce unfairness in web services and the web data, leading to a vicious cycle. In the context of image classification, networks learn bias attributes for a specific class when a majority of images contain the same attribute only for a given class. Hence, training a fair and debiased classifier from a biased dataset demands handling an imbalanced problem between a majority of images with bias attributes (bias-aligned samples) and a minority without (bias-conflict samples). In this work, we introduce BiasEdit, a modular framework that automatically detects bias attributes from the original dataset and edits them to construct a debiased dataset. Specifically, BiasEdit first detects unknown bias attributes via statistical dependence and mutual information analysis of visual-linguistic representations, and then explicitly edits those attributes using text-guided image editing to generate realistic bias-conflict samples. Unlike prior works that assume known bias attributes or relies on synthetic mixing, our method operates without manual annotations and can leverage off-the-shelf vision-language and editing models. BiasEdit addresses a fundamental challenge in Web-sourced visual AI, mitigating dataset-induced bias and achieving state-of-the-art debiasing performance even when training data are fully biased.

Jungwook Seo, Yoonsik Park, Changmin Lee, Sungyong Baik• 2026

Related benchmarks

Task	Dataset	Result
Image Classification	Waterbirds (test)	--	214
Image Classification	ColoredMNIST (test)	Average Accuracy0.9395	27
Image Classification	BFFHQ (test)	Accuracy (BC)71.33	18
Image Classification	Dogs & Cats (test)	Accuracy (BC)69.04	18
Image Classification	Colored MNIST 1.0% bias-conflict ratio	BC Score92.73	9
Image Classification	Colored MNIST 5.0% bias-conflict ratio	BC Score91.72	9
Image Classification	BFFHQ 5.0% bias-conflict ratio	BC Score82.73	9
Image Classification	Dogs & Cats 1.0% bias-conflict ratio	BC63.04	9
Image Classification	Dogs & Cats 5.0% bias-conflict ratio	BC Score77.7	9
Image Classification	Waterbirds 1.0% bias-conflict ratio	Bias Conflict (BC)59.79	9

Showing 10 of 14 rows

Other info

Follow for update

@wizwand_team Discord