Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Red-teaming Safety Evaluation on XSTEST
Loading...
61
HPR
Meta-Llama-3.1-8B (Unaligned)
21.48
31.74
42
52.26
May 30, 2025
HPR
HS
ASR
Updated 4d ago
Evaluation Results
Method
Method
Links
HPR
HS
ASR
Meta-Llama-3.1-8B (Unaligned)
base_model=Meta-Llama-...
2025.05
61
3.04
41
Meta-Llama-3.1-8B (HH_RLHF-aligned)
base_model=Meta-Llama-...
2025.05
50
3.27
35
Meta-Llama-3.1-8B (TRIDENT-EDGE-aligned)
base_model=Meta-Llama-...
2025.05
40
2.02
3
Meta-Llama-3.1-8B (SAFE_RLHF-aligned)
base_model=Meta-Llama-...
2025.05
39
2.34
6
Meta-Llama-3.1-8B (WILDBREAK-aligned)
base_model=Meta-Llama-...
2025.05
38
2.19
8
Meta-Llama-3.1-8B (WILDCHAT-aligned)
base_model=Meta-Llama-...
2025.05
34
2.23
11
Meta-Llama-3.1-8B (AART-aligned)
base_model=Meta-Llama-...
2025.05
27
2.08
11
Meta-Llama-3.1-8B (ATTAQ-aligned)
base_model=Meta-Llama-...
2025.05
23
2.24
16
Feedback
Search any
task
Search any
task