Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reward Modeling on RM-Bench Normal
Loading...
80
Accuracy
INF-ORM-Llama3.1-70B
71.16
73.455
75.75
78.045
Feb 9, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
INF-ORM-Llama3.1-70B
Backbone=Llama3.1-70B
2026.02
80
WILDREWARD-8B
Size=8B
2026.02
78.4
WILDREWARD-4B
Size=4B
2026.02
77
Athene-RM-8B
Size=8B
2026.02
76.6
Llama-3.1-Nemotron-70B
Size=70B
2026.02
76.5
Skywork-Reward-Llama-3.1-8B-v0.2
Backbone=Llama-3.1-8B
2026.02
74.2
Internlm2-20b-reward
Size=20b
2026.02
74.2
Llama-3-OffsetBias-RM-8B
Size=8B
2026.02
73.2
Skywork-Reward-Gemma-2-27B-v0.2
Backbone=Gemma-2-27B
2026.02
71.9
ArmoRM-Llama3-8B-v0.1
Backbone=Llama3-8B
2026.02
71.5
Feedback
Search any
task
Search any
task