Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reward Modeling on RM-Bench Easy

92.2Accuracy

Llama-3.1-Nemotron-70B

69.63275.49181.3587.209Feb 9, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
92.2
2026.02
92.1
2026.02
89.8
2026.02
88.9
83.9
2026.02
83.5
2026.02
82
2026.02
80.4
2026.02
79.4
2026.02
70.5