Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reward Modeling on IFBench (test)
Loading...
57.9
Accuracy
RM-Distiller
36.06
41.73
47.4
53.07
Jan 20, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
RM-Distiller
Student=Qwen2.5-3B-Ins...
2026.01
57.9
RM-Distiller
Student=Qwen2.5-3B-Ins...
2026.01
57.7
Qwen3-14B
2026.01
54.2
BT Classifier
Student=Qwen2.5-3B-Ins...
2026.01
52.5
GPT-4o
2026.01
50.8
BT Classifier
Student=Qwen2.5-3B-Ins...
2026.01
50.2
Qwen2.5-3B-Instruct
2026.01
36.9
Feedback
Search any
task
Search any
task