Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

UltraFeedback

Benchmarks

Task NameDataset NameSOTA ResultTrend
RLHF AlignmentUltraFeedback In-domain v1 (test)
Win Rate81
46
MT-BenchUltraFeedback
MT-Bench Score8.1
42
AlpacaEval 2.0UltraFeedback
LC30
42
Reward ModelingUltraFeedback (test)
MAE0.145
38
Controllable GenerationCode-UltraFeedback
Diversity90.7
36
Generative PerformanceUltrafeedback 61.1k (test)
Win Rate69.8
30
Discriminative PerformanceUltrafeedback 61.1k (test)
Accuracy73.05
30
Response GenerationUltraFeedback (val)
BERTScore88.1
24
LLM JudgmentUltraFeedback
Accuracy68.75
23
Multi-turn Conversation EvaluationUltraFeedback
MT-Bench Score6.1
20
Sequential Preference OptimizationUltraFeedback
Harmless Rate99.71
20
AlignmentUltraFeedback (test)
Honesty Score63.72
20
Best-of-N Reward EvaluationUltraFeedback core250
Reward Score24.323
18
Reward ModelingUltraFeedback core250 (held-out evaluation)
Delta (Δ)3.543
18
LLM AlignmentUltraFeedback (test)
AlpacaEval 2 Win Rate (WR)21
18
Reward Model TransferUltraFeedback (UF)
AOG7.93
16
Preference AlignmentUltraFeedback
Win Rate81
16
Instruction FollowingUltraFeedback (core250)
Delta Preference Score (bo64)12.568
15
Pairwise Judge ComparisonUltraFeedback core250
Win Count (W)161
14
Preference EvaluationUltraFeedback core250 (test)
Win Rate80
12
Preference AlignmentUltrafeedback 40% flipping ratio
Accuracy78.87
12
Preference AlignmentUltrafeedback 20% flipping ratio
Accuracy78.8
12
Preference AlignmentUltraFeedback (test)
Accuracy74.18
11
Direct Preference OptimizationUltraFeedback
Accuracy69.92
11
Multi-agent ReasoningUltrafeedback
Accuracy73.66
9
Showing 25 of 58 rows