Share your thoughts, 1 month free Claude Pro on usSee more

Safety Evaluation on XSTest

98.4Safety Score

Base

Updated 1mo ago

Evaluation Results

Method	Links
Base 2025.09		98.4
Qwen2.5 Instruct (72B) 2025.01		97.9
Qwen3.5-9B 2026.05		97.6
GPT-4o (2024-05-13) 2025.01		97.3
DeepSeek-V3 2025.01		97.1
o1 (2024-12-17) 2025.01		97
Qwen3.5-4B 2026.05		96.8
Ministral-3-14B 2026.05		96.8
Claude-3.7-Sonnet 2025.01		96.4
DeepSeek-R1 2025.01		95.3
DeepSeek-R1 2025.01		94.4
IQuest-Coder-V1-40B-Thinking 2026.03		94.3
OLMo-3-7B 2026.05		93.2
Mellum 2 (SFT) 2026.05		90.8
Qwen2.5-Coder-32B-Instruct 2026.03		90.6
Qwen3-Coder-480B-A35B-Instruct 2026.03		90.1
Mellum 2 (RL) 2026.05		89.6
IQuest-Coder-V1-40B-Instruct 2026.03		89.3
Self-Improving Pretraining 2026.01		88.4
TARS 2025.09		88.3
Llama Pretrain Baseline 2026.01		87.6
GRPO 2025.09		86.8
Llama Base 2026.01		85.2
BackTrack 2025.09		80
IPO 2025.09		80
STAR 2025.09		76.9
Self-Improving Pretraining 2026.01		49
Llama Base 2026.01		39.5
Llama Pretrain Baseline 2026.01		35
PCA-HMM 2026.05		15
PCA-HMM 2026.05		9.5
PCA-HMM 2026.05		6