Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
LRM-specific Jailbreak Evaluation on HCoT
Loading...
100
ASR
Base
89.6
92.3
95
97.7
May 9, 2026
ASR
Updated 22d ago
Evaluation Results
Method
Method
Links
ASR
Base
Model Architecture=Dee...
2026.05
100
SafeChain
Model Architecture=Dee...
2026.05
100
Base
Model Architecture=Dee...
2026.05
100
SafeChain
Model Architecture=Dee...
2026.05
100
STAR-1
Model Architecture=Dee...
2026.05
100
Base
Model Architecture=Dee...
2026.05
100
SafeChain
Model Architecture=Dee...
2026.05
100
STAR-1
Model Architecture=Dee...
2026.05
100
STAR-1
Model Architecture=Dee...
2026.05
98
SInternal
Model Architecture=Dee...
2026.05
94
SInternal
Model Architecture=Dee...
2026.05
92
SInternal
Model Architecture=Dee...
2026.05
90
Feedback
Search any
task
Search any
task