Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Preference Learning on Anthropic HH-RLHF+VI Preference (test)

64Overall Accuracy

MC-STL

58.860.1561.562.85Jan 10, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
64641.02-0.010.61
2026.01
60590.940.030.58
2026.01
59560.940.020.58