Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reward Modeling on RM-Bench Normal

80Accuracy

INF-ORM-Llama3.1-70B

71.1673.45575.7578.045Feb 9, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
80
2026.02
78.4
2026.02
77
2026.02
76.6
2026.02
76.5
2026.02
74.2
2026.02
74.2
73.2
2026.02
71.9
2026.02
71.5