Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reward Modeling on IFBench (test)

57.9Accuracy

RM-Distiller

36.0641.7347.453.07Jan 20, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.01
57.9
2026.01
57.7
2026.01
54.2
2026.01
52.5
2026.01
50.8
2026.01
50.2
36.9