Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reward Modeling on PersonalRewardBench (test)

65.21Macro Accuracy

P-GenRM-8B

55.881258.303160.72563.1469Feb 12, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
65.21
2026.02
63.33
2026.02
61.51
2026.02
60.64
2026.02
58.27
2026.02
56.24