Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
LLM Jailbreaking on Mistral-RB
Loading...
58
SRF
Adaptive Probe-based Steering
-1.28
14.11
29.5
44.89
May 19, 2026
SRF
HB
SR
Updated 13d ago
Evaluation Results
Method
Method
Links
SRF
HB
SR
Adaptive Probe-based Steering
2026.05
58
63
85
RD-C
2026.05
23
25
34
RD-A
2026.05
7
3
7
Angular
2026.05
7
1
12
RepE
2026.05
4
1
4
SCAV
2026.05
1
3
0
Feedback
Search any
task
Search any
task