Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
LRM-specific Jailbreak Evaluation on Trotter
Loading...
54
ASR
Base
18.328
27.589
36.85
46.111
May 9, 2026
ASR
Updated 22d ago
Evaluation Results
Method
Method
Links
ASR
Base
Model Architecture=Dee...
2026.05
54
STAR-1
Model Architecture=Dee...
2026.05
49
SafeChain
Model Architecture=Dee...
2026.05
44.4
Base
Model Architecture=Dee...
2026.05
37.9
STAR-1
Model Architecture=Dee...
2026.05
34.3
SInternal
Model Architecture=Dee...
2026.05
33.3
STAR-1
Model Architecture=Dee...
2026.05
32.3
SafeChain
Model Architecture=Dee...
2026.05
29.8
SafeChain
Model Architecture=Dee...
2026.05
29.8
SInternal
Model Architecture=Dee...
2026.05
26.3
Base
Model Architecture=Dee...
2026.05
23.7
SInternal
Model Architecture=Dee...
2026.05
19.7
Feedback
Search any
task
Search any
task