Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reward Modeling on EvalBiasBench
Loading...
0.75
Accuracy
Qwen3-14B
0.282
0.4035
0.525
0.6465
Jan 20, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen3-14B
Role=Student
2026.01
0.75
GPT-4o
Role=Student
2026.01
0.7
RM-Distiller
Student=Qwen2.5-3B-Ins...
2026.01
0.638
RM-Distiller
Student=Qwen2.5-3B-Ins...
2026.01
0.575
BT Classifier
Student=Qwen2.5-3B-Ins...
2026.01
0.475
Qwen2.5-3B-Instruct
Role=Student
2026.01
0.444
BT Classifier
Student=Qwen2.5-3B-Ins...
2026.01
0.3
Feedback
Search any
task
Search any
task