Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
LLM Jailbreaking on Llama3 TAR
Loading...
32
Success Rate First (SRF)
Adaptive Probe-based Steering
-1.28
7.36
16
24.64
May 19, 2026
Success Rate First (SRF)
Harmful Behavior (HB)
Success Rate (SR)
Updated 13d ago
Evaluation Results
Method
Method
Links
Success Rate First (SRF)
Harmful Behavior (HB)
Success Rate (SR)
Adaptive Probe-based Steering
2026.05
32
24
50
Angular
2026.05
25
22
40
RD-A
2026.05
19
14
27
RD-C
2026.05
15
16
27
SCAV
2026.05
1
1
1
RepE
2026.05
0
0
0
Feedback
Search any
task
Search any
task