Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reward Modeling on Arabic preference (test)

85.4Accuracy

RM-Distiller-Qwen2.5-3B-Instruct

71.67275.23678.882.364Jan 20, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
85.4
2026.01
83.2
2026.01
83.2
2026.01
81.3
2026.01
76.7
2026.01
75.9
2026.01
72.2