Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA LLM Alignment benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
LLM Alignment
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
HelpSteer (test)
PD
AlpacaEval 2 WR
8.34
27
1mo ago
HH-RLHF (test)
RE-CONTROL + Prompting
Win Rate
80.3
21
1mo ago
UltraFeedback (test)
PD (ours)
AlpacaEval 2 Win Rate (WR)
21
18
1mo ago
HH-RLHF 300 prompts
CARDS
Win/Tie Rate vs Vanilla (GPT-4o)
69.8
16
1mo ago
SHP
RE-CONTROL + Prompting
Diversity
89.3
15
1mo ago
Taobao Live proprietary fine-grained preference dataset
PD (ours)
Win Score
1.53
13
1mo ago
Gemma-3-4B
Nash Prox
Win Rate
94.33
12
25d ago
AlpacaEval
AdaBoN
Percent Batches (BWR > 0.50)
100
12
1mo ago
PKU-SafeRLHF
AdaBoN
BWR (Median)
49
12
1mo ago
Alpaca, BeaverTails, and TruthfulQA (test)
AlignX
Win Rate
97.1
12
1mo ago
Combined Suite Setup 3
AMA Reweighting
Average Percentage Score
54.38
9
1mo ago
UltraFeedback (in-domain)
GEB-π
Win Rate (KL, alpha=1)
80.6
8
1mo ago
Honesty
AlignX
Truthfulness Index
84
7
1mo ago
Harmlessness
AlignX
WR
87.85
7
1mo ago
Helpfulness
AlignX
Truthfulness Index
0.891
7
1mo ago
Base Model Evaluation Set
AlignX
Win Rate
79.93
6
1mo ago
UltraFeedback 2023 (test)
MARS
Win-rate
55
4
1mo ago
PKU-SafeRLHF 2024 (test)
MARS
Win Rate
0.58
4
1mo ago
Anthropic HH-RLHF 2022 (test)
MARS
Win Rate
62
4
1mo ago
PKU-Safety (test)
DLMA-7B
Win Rate
58
2
1mo ago
HH-Harmless (test)
DLMA-7B
Win Rate
59
2
1mo ago
Showing 21 of 21 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs