Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Safety Evaluation on StrongReject (Harmful Score)
Loading...
0
Harmful Score
Base
-0.128
0.736
1.6
2.464
May 9, 2026
Harmful Score
Updated 22d ago
Evaluation Results
Method
Method
Links
Harmful Score
Base
Model Backbone=DeepSee...
2026.05
0
STAR-1
Model Backbone=DeepSee...
2026.05
0
SInternal
Model Backbone=DeepSee...
2026.05
0
Base
Model Backbone=DeepSee...
2026.05
0
SafeChain
Model Backbone=DeepSee...
2026.05
0
STAR-1
Model Backbone=DeepSee...
2026.05
0
SInternal
Model Backbone=DeepSee...
2026.05
0
SInternal
Model Backbone=DeepSee...
2026.05
0.3
SafeChain
Model Backbone=DeepSee...
2026.05
0.3
SafeChain
Model Backbone=DeepSee...
2026.05
1
STAR-1
Model Backbone=DeepSee...
2026.05
1
Base
Model Backbone=DeepSee...
2026.05
3.2
Feedback
Search any
task
Search any
task