Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reward Modeling on Arabic preference (test)

85.4Accuracy

RM-Distiller-Qwen2.5-3B-Instruct

71.67275.23678.882.364Jan 20, 2026
Updated 4d ago

Evaluation Results

MethodLinks
85.4
2026.01
83.2
2026.01
83.2
2026.01
81.3
2026.01
76.7
2026.01
75.9
2026.01
72.2