Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Preference Evaluation on HH-Helpful
Loading...
52
Win Count
DLMA-13B
38.48
41.99
45.5
49.01
Feb 19, 2024
Win Count
Lose Count
Tie Count
Updated 4d ago
Evaluation Results
Method
Method
Links
Win Count
Lose Count
Tie Count
DLMA-13B
Comparison Baseline=Ll...
2024.02
52
14
34
DLMA-7B
Comparison Baseline=RL...
2024.02
48
14
38
DLMA-13B
Comparison Baseline=RL...
2024.02
47
18
35
DLMA-7B
Comparison Baseline=Ll...
2024.02
46
15
39
DLMA-13B
Comparison Baseline=CD...
2024.02
46
21
33
DLMA-7B
Comparison Baseline=CD-7B
2024.02
43
18
39
DLMA-13B
Comparison Baseline=RL...
2024.02
41
20
39
DLMA-7B
Comparison Baseline=RL...
2024.02
39
21
40
Feedback
Search any
task
Search any
task