Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Preference Evaluation on PKU-SafeRLHF
Loading...
57
Win Rate
DLMA-13B
41.4
45.45
49.5
53.55
Feb 19, 2024
Win Rate
Lose Rate
Tie Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Win Rate
Lose Rate
Tie Rate
DLMA-13B
Comparison Baseline=Ll...
2024.02
57
8
35
DLMA-7B
Comparison Baseline=RL...
2024.02
56
8
36
DLMA-7B
Comparison Baseline=Ll...
2024.02
55
8
37
DLMA-13B
Comparison Baseline=RL...
2024.02
55
11
34
DLMA-13B
Comparison Baseline=CD...
2024.02
49
16
45
DLMA-7B
Comparison Baseline=RL...
2024.02
43
25
32
DLMA-13B
Comparison Baseline=RL...
2024.02
43
24
33
DLMA-7B
Comparison Baseline=CD-7B
2024.02
42
15
43
Feedback
Search any
task
Search any
task