Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
LLM Alignment on Anthropic-HH (test)
Loading...
57.53
GPT-4o Win Rate
DPPrefSyn
28.6596
36.1548
43.65
51.1452
May 29, 2026
GPT-4o Win Rate
Updated 2d ago
Evaluation Results
Method
Method
Links
GPT-4o Win Rate
DPPrefSyn
Data Type=Synthetic, P...
2026.05
57.53
DPPrefSyn
Data Type=Synthetic, P...
2026.05
55.95
DPPrefSyn
Data Type=Synthetic, P...
2026.05
55.08
DPPrefSyn
Data Type=Synthetic, P...
2026.05
54.9
DP-FT
Data Type=Original, Pr...
2026.05
38.72
DP-FT
Data Type=Original, Pr...
2026.05
35
DP-FT
Data Type=Original, Pr...
2026.05
31.98
DP-FT
Data Type=Original, Pr...
2026.05
29.77
Feedback
Search any
task
Search any
task