Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
LLM Safety Defense on BeaverTails
Loading...
10.39
ASR
MTSA-T3
9.8256
13.6353
17.445
21.2547
May 22, 2025
ASR
Updated 4d ago
Evaluation Results
Method
Method
Links
ASR
MTSA-T3
Target Model=Llama2-7B...
2025.05
10.39
MTSA-T3
Target Model=Zephyr-7B...
2025.05
11.5
Baseline
Target Model=Llama2-7B...
2025.05
21.5
Baseline
Target Model=Zephyr-7B...
2025.05
24.5
Feedback
Search any
task
Search any
task