Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Judge Accuracy on RMB-SafeRLHF (Bench)

85.6Accuracy

Biased Rubric Search

68.85673.20377.5581.897Feb 14, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
85.6
2026.02
81.7
2026.02
70.3
2026.02
69.5