Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reward Modeling on EvalBiasBench

0.75Accuracy

Qwen3-14B

0.2820.40350.5250.6465Jan 20, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
0.75
2026.01
0.7
2026.01
0.638
2026.01
0.575
2026.01
0.475
2026.01
0.444
2026.01
0.3