Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Safety Alignment Evaluation on StrongReject SR-PAP_M
Loading...
100
Safety Rate
MESA
69.7672
77.6161
85.465
93.3139
May 30, 2026
Safety Rate
Updated 1d ago
Evaluation Results
Method
Method
Links
Safety Rate
MESA
Architecture=DeepSeek-...
2026.05
100
GRPO
Architecture=Qwen3-30B...
2026.05
100
Stair-SFT
Architecture=Qwen3-30B...
2026.05
100
Stair-DPO
Architecture=Qwen3-30B...
2026.05
100
MESA
Architecture=Qwen3-30B...
2026.05
100
SafeX
Architecture=Qwen3-30B...
2026.05
99.68
Stair-DPO
Architecture=DeepSeek-...
2026.05
99.04
SFT
Architecture=Qwen3-30B...
2026.05
98.72
Stair-SFT
Architecture=DeepSeek-...
2026.05
97.76
SFT
Architecture=DeepSeek-...
2026.05
96.81
SafeX
Architecture=DeepSeek-...
2026.05
93.93
Base(instruct)
Architecture=Qwen3-30B...
2026.05
92.97
GRPO
Architecture=DeepSeek-...
2026.05
71.89
Base(chat)
Architecture=DeepSeek-...
2026.05
70.93
Feedback
Search any
task
Search any
task