Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reward Modeling on IFBench (test)

57.9Accuracy

RM-Distiller

36.0641.7347.453.07Jan 20, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
57.9
2026.01
57.7
2026.01
54.2
2026.01
52.5
2026.01
50.8
2026.01
50.2
36.9