Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Safety on Safety Evaluation Suite
Loading...
0.911
Score
OLMo 2 7B Inst
0.703
0.757
0.811
0.865
Dec 15, 2025
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
OLMo 2 7B Inst
stage=Instruct
2025.12
0.911
Olmo 3 7B Instruct
stage=DPO
2025.12
0.899
Olmo 3 7B Instruct
stage=SFT
2025.12
0.895
Olmo 3 7B Instruct
stage=Final Instruct
2025.12
0.876
Qwen 3 8B
stage=Instruct
2025.12
0.784
Qwen 3 VL 8B Inst
stage=Instruct
2025.12
0.777
Granite 3.3 8B Inst
stage=Instruct
2025.12
0.743
Qwen 2.5 7B
stage=Instruct
2025.12
0.734
Apertus 8B Inst
stage=Instruct
2025.12
0.711
Feedback
Search any
task
Search any
task