Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
LLM Alignment on OpenAssistant (test)
Loading...
11.85
GPT-4o Win Rate
DPPrefSyn
1.7204
4.3502
6.98
9.6098
May 29, 2026
GPT-4o Win Rate
Updated 1d ago
Evaluation Results
Method
Method
Links
GPT-4o Win Rate
DPPrefSyn
Privacy budget (ε)=4,...
2026.05
11.85
DPPrefSyn
Privacy budget (ε)=4,...
2026.05
10.11
DPPrefSyn
Privacy budget (ε)=4,...
2026.05
9.33
DP-FT
Privacy budget (ε)=∞,...
2026.05
8.2
DP-FT
Privacy budget (ε)=∞,...
2026.05
7.08
DP-FT
Privacy budget (ε)=∞,...
2026.05
4.75
Base LLM
Privacy budget (ε)=0
2026.05
2.11
Feedback
Search any
task
Search any
task