Share your thoughts, 1 month free Claude Pro on usSee more

Fairness Evaluation on CEB Adult

68.3Score

Self-Debias Iter1

Updated 3mo ago

Evaluation Results

Method	Links
Self-Debias Iter1 2026.04		68.3
Self-Debias Iter2 + Self-Correction 2026.04		68.1
Qwen2.5-7B-Instruct 2026.04		68
Self-Debias Offline 2026.04		67.5
Self-Debias Iter1 + Self-Correction 2026.04		67.2
Self-Debias Offline + Self-Correction 2026.04		67.1
Self-Debias Iter2 2026.04		67.1
Self-Debias SFT + Self-Correction 2026.04		66.9
Self-Debias SFT 2026.04		66.5
Qwen2.5-7B-Instruct + Self-Correction 2026.04		63.7
Qwen1.5-8B 2026.04		63.1
DeepSeek-R1-Distill-Qwen-7B 2026.04		50.3
DeepSeek-R1-Distill-Qwen-7B + Self-Correction 2026.04		49.2
Qwen1.5-8B + Self-Correction 2026.04		37.1
Llama-3.1-8B-Instruct 2026.04		21.6
Llama-3.1-8B-Instruct + Self-Correction 2026.04		6.9