Share your thoughts, 1 month free Claude Pro on usSee more

Reward Modeling on RM-Bench Easy

92.2Accuracy

Llama-3.1-Nemotron-70B

Updated 4mo ago

Evaluation Results

Method	Links
Llama-3.1-Nemotron-70B 2026.02		92.2
INF-ORM-Llama3.1-70B 2026.02		92.1
Athene-RM-8B 2026.02		89.8
Skywork-Reward-Gemma-2-27B-v0.2 2026.02		88.9
Llama-3-OffsetBias-RM-8B 2026.02		83.9
WILDREWARD-8B 2026.02		83.5
WILDREWARD-4B 2026.02		82
ArmoRM-Llama3-8B-v0.1 2026.02		80.4
Internlm2-20b-reward 2026.02		79.4
Skywork-Reward-Llama-3.1-8B-v0.2 2026.02		70.5