Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reward Modeling on PersonalRewardBench (test)

65.21Macro Accuracy

P-GenRM-8B

55.881258.303160.72563.1469Feb 12, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
65.21
2026.02
63.33
2026.02
61.51
2026.02
60.64
2026.02
58.27
2026.02
56.24