Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Safety Evaluation on SIUO
Loading...
77.4
Safe Rate
Safework-R1-7B
19.3264
34.4032
49.48
64.5568
Oct 16, 2024
Dec 24, 2024
Mar 4, 2025
May 13, 2025
Jul 21, 2025
Sep 29, 2025
Dec 8, 2025
Safe Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Safe Rate
Safework-R1-7B
Model=Safework-R1-7B
2025.12
77.4
Gemini-2.5-pro
Model=Gemini-2.5-pro
2025.12
76.7
TRR
Backbone=Qwen2.5VL-7B,...
2025.12
76.2
MSR-Align
Backbone=Qwen2.5VL-32B...
2025.12
73.2
TRR
Backbone=Qwen2.5VL-32B...
2025.12
71.5
MSR-Align
Backbone=Qwen2.5VL-7B,...
2025.12
70.7
TiS
Backbone=Qwen2.5VL-32B...
2025.12
67.1
Claude-3.5-Sonnet
Model=Claude-3.5-Sonnet
2025.12
56.7
GPT-4o
Model=GPT-4o
2025.12
51.8
Qwen2.5VL-32B
Backbone=Qwen2.5VL-32B...
2025.12
42.7
TiS
Backbone=Qwen2.5VL-7B,...
2025.12
37.8
Qwen2.5VL-7B
Backbone=Qwen2.5VL-7B,...
2025.12
30.8
TGA
Model Size=7B
2024.10
30.77
LLaVA-v1.5
Model Size=13B
2024.10
22.16
LLaVA-v1.5
Model Size=7B
2024.10
21.56
Feedback
Search any
task
Search any
task