Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Unsafe content detection on VLGuard
Loading...
79.3
F1 Score
OMNIGUARD-3B
-3.172
18.239
39.65
61.061
Dec 2, 2025
F1 Score
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
Accuracy
OMNIGUARD-3B
Size=3B
2025.12
79.3
81.9
OMNIGUARD-7B
Size=7B
2025.12
79.1
81.7
Qwen2.5-VL-72B
Size=72B
2025.12
78.5
77.4
Qwen3-VL-235B
Size=235B
2025.12
77.4
76.5
GPT-4o
Size=-
2025.12
75.5
79.7
LlavaGuard-v1.2-7B
Size=7B
2025.12
69.8
74
Qwen2.5-Omni-7B
Size=7B
2025.12
64.4
70.8
Qwen2.5-VL-7B
Size=7B
2025.12
62.8
48.2
LLaMA Guard 3V
Size=11B
2025.12
0
55.8
Feedback
Search any
task
Search any
task