Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

BiasEdit: A Training-Free Bias-Detect-and-Edit Framework for Learning Fair Visual Classifiers

About

Visual data from the Web power image classifiers, which often underpin many web services, such as recommendation and content moderation. However, the raw Web data often contain spurious correlations and social biases, and neural networks are known for their tendency to learn biases present in data. This can reinforce unfairness in web services and the web data, leading to a vicious cycle. In the context of image classification, networks learn bias attributes for a specific class when a majority of images contain the same attribute only for a given class. Hence, training a fair and debiased classifier from a biased dataset demands handling an imbalanced problem between a majority of images with bias attributes (bias-aligned samples) and a minority without (bias-conflict samples). In this work, we introduce BiasEdit, a modular framework that automatically detects bias attributes from the original dataset and edits them to construct a debiased dataset. Specifically, BiasEdit first detects unknown bias attributes via statistical dependence and mutual information analysis of visual-linguistic representations, and then explicitly edits those attributes using text-guided image editing to generate realistic bias-conflict samples. Unlike prior works that assume known bias attributes or relies on synthetic mixing, our method operates without manual annotations and can leverage off-the-shelf vision-language and editing models. BiasEdit addresses a fundamental challenge in Web-sourced visual AI, mitigating dataset-induced bias and achieving state-of-the-art debiasing performance even when training data are fully biased.

Jungwook Seo, Yoonsik Park, Changmin Lee, Sungyong Baik• 2026

Related benchmarks

TaskDatasetResultRank
Image ClassificationWaterbirds (test)--
127
Image ClassificationColoredMNIST (test)
Average Accuracy0.9395
27
Image ClassificationBFFHQ (test)
Accuracy (BC)71.33
18
Image ClassificationDogs & Cats (test)
Accuracy (BC)69.04
18
Image ClassificationColored MNIST 1.0% bias-conflict ratio
BC Score92.73
9
Image ClassificationColored MNIST 5.0% bias-conflict ratio
BC Score91.72
9
Image ClassificationBFFHQ 5.0% bias-conflict ratio
BC Score82.73
9
Image ClassificationDogs & Cats 1.0% bias-conflict ratio
BC63.04
9
Image ClassificationDogs & Cats 5.0% bias-conflict ratio
BC Score77.7
9
Image ClassificationWaterbirds 1.0% bias-conflict ratio
Bias Conflict (BC)59.79
9
Showing 10 of 14 rows

Other info

Follow for update