Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Red-teaming on HarmBench Llama-3-8B (test)
Loading...
0.98
ASR
AGENTICRED
0.6056
0.7028
0.8
0.8972
Jan 20, 2026
ASR
Updated 1mo ago
Evaluation Results
Method
Method
Links
ASR
AGENTICRED
2026.01
0.98
AdvReasoning
2026.01
0.88
TransferAttack
2026.01
0.85
CoP
2026.01
0.71
AutoDAN-Turbo
2026.01
0.62
Feedback
Search any
task
Search any
task