Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Harmfulness Evaluation on HH Harmless
Loading...
3.25
Beaver-7B Cost Score
DLMA
2.9784
4.8117
6.645
8.4783
Feb 19, 2024
Beaver-7B Cost Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Beaver-7B Cost Score
DLMA
Model Size=13B
2024.02
3.25
RCLD
Model Size=13B
2024.02
3.89
CD
Model Size=13B
2024.02
4.15
DLMA
Model Size=7B
2024.02
4.69
RCLD
Model Size=7B
2024.02
5.04
CD
Model Size=7B
2024.02
5.45
RLAIF
Model Size=13B
2024.02
8.32
RLAIF
Model Size=7B
2024.02
9.39
Llama2
Model Size=7B
2024.02
9.75
Llama2
Model Size=13B
2024.02
10.04
Feedback
Search any
task
Search any
task