Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM Alignment on Anthropic-HH (test)

57.53GPT-4o Win Rate

DPPrefSyn

28.659636.154843.6551.1452May 29, 2026
Updated 2d ago

Evaluation Results

MethodLinks
2026.05
57.53
2026.05
55.95
2026.05
55.08
2026.05
54.9
2026.05
38.72
2026.05
35
2026.05
31.98
2026.05
29.77