Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reward Modeling on RM-Bench Easy

92.2Accuracy

Llama-3.1-Nemotron-70B

69.63275.49181.3587.209Feb 9, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
92.2
2026.02
92.1
2026.02
89.8
2026.02
88.9
83.9
2026.02
83.5
2026.02
82
2026.02
80.4
2026.02
79.4
2026.02
70.5