Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Robustness Evaluation on BiasBench
Loading...
82.5
Accuracy
Qwen2.5-32B-Instruct
64.3
69.025
73.75
78.475
Jan 7, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen2.5-32B-Instruct
variant=Instruct
2026.01
82.5
DeepSeek-V3
2026.01
81.25
Qwen3-30B-A3B-Instruct-2507
variant=Instruct
2026.01
81.25
Qwen3-Next-80B-A3B-Instruct
variant=Instruct
2026.01
80
Qwen3-30B-A3B-Thinking-2507
variant=Thinking
2026.01
77.5
Qwen3-Next-80B-A3B-Thinking
variant=Thinking
2026.01
75
QwQ-32B
2026.01
67.5
DeepSeek-R1
2026.01
65
Feedback
Search any
task
Search any
task