Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reward Modeling on RewardBench (test)

0.933RWBench

J1-Llama-70B

0.744760.793630.84250.89137Jan 28, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
0.933
2026.01
0.927
2026.01
0.926
2026.01
0.914
2026.01
0.912
2026.01
0.91
2026.01
0.905
2026.01
0.9
2026.01
0.894
2026.01
0.89
2026.01
0.882
2026.01
0.867
2026.01
0.86
2026.01
0.857
2026.01
0.854
2026.01
0.852
2026.01
0.822
2026.01
0.82
2026.01
0.814
2026.01
0.812
2026.01
0.81
2026.01
0.807
2026.01
0.773
2026.01
0.766
2026.01
0.752