Bias Evaluation

Benchmarks

Dataset Name	SOTA Method	Metric
BBQ	C2PO	Accuracy99.3	175	18d ago
Task 2 Persona F		BAD Score0.3	25	3mo ago
Task 2 Persona E		BAD Score0.017	25	3mo ago
Task 2 Persona D		BAD Score0.007	25	3mo ago
Task 2 Persona C		BAD Score-0.18	25	3mo ago
Task 2 Persona B		BAD Score-0.007	25	3mo ago
Task 2 Persona A		BAD Score-0.006	25	3mo ago
Race & SES	iPASwo	Mean Improvement17	18	2mo ago
KoBBQ		Ambiguous Context Score98.3	17	4mo ago
BBQ averaged across gender, nationality, and religion domains	Self-Debiasing	Accuracy (Ambiguous)87.73	16	4mo ago
SOCT	Llama 3.1 8B - LFT w. SH-N (baseline 3)	DR (Female Stereotype)0.054	15	4mo ago
Crows-pairs	Qwen 3 0.6B - LFT w. SH (baseline 2)	Pct Stereotype51.25	15	4mo ago
Honest	Llama 3.1 8B - LFT w. SH-Dgender (BaseCDA)	Honest Score11.7	15	4mo ago
Reddit Bias	Llama 3.1 8B - Pretrained model (baseline 1)	t-value-4.7523	15	4mo ago
Male-biased prompts	Manually curated	Male Bias (Base)0.53	14	4mo ago
CrowS-Pairs	CDA	CS Score50.01	13	3mo ago
HolisticBias	PaCE	GN Score66.2	10	4mo ago
SexualOrientation	iPASwo	Mean Improvement0.1	9	2mo ago
Religion	iPASwo	Mean Improvement0.11	9	2mo ago
Race & Gender	iPASwo	Mean Improvement20	9	2mo ago
RaceEthnicity	iPASa	Mean Improvement15	9	2mo ago
PhysicalAppearance	iPASa	Mean Improvement0.07	9	2mo ago
Nationality	iPASwo	Mean Improvement0.12	9	2mo ago
GenderIdentity	iPASwo	Mean Improvement0.12	9	2mo ago
DisabilityStatus	iPASwo	Mean Improvement0.17	9	2mo ago

Showing 25 of 51 rows