Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Safety Alignment on AdvBench (HarmRate)
Loading...
0
Harm Rate
BoN64
-1.52
8.74
19
29.26
May 26, 2025
Harm Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Harm Rate
BoN64
Base Model=LLaMA2-13B-...
2025.05
0
SEA
Base Model=LLaMA2-13B-...
2025.05
0
SFT
Base Model=LLaMA2-13B-...
2025.05
38
Feedback
Search any
task
Search any
task