Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Red-teaming on HarmBench (test)
Loading...
85.1
ASR@1
DIALTREE
20.1
36.975
53.85
70.725
Oct 2, 2025
ASR@1
ASR@3
ASR@5
Updated 1mo ago
Evaluation Results
Method
Method
Links
ASR@1
ASR@3
ASR@5
DIALTREE
Target LLMs=Average ac...
2025.10
85.1
98.6
99.5
X-Teaming
Target LLMs=Average ac...
2025.10
44.9
69.6
78.9
ActorAttack
Target LLMs=Average ac...
2025.10
22.6
38.5
45.1
Feedback
Search any
task
Search any
task