Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Prompt Harmfulness Detection on UCF-crime
Loading...
91.67
F1 Score
GuardReasoner-Omni 4B
6.0364
28.2682
50.5
72.7318
Feb 3, 2026
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
GuardReasoner-Omni 4B
Model Category=VLM Gua...
2026.02
91.67
GuardReasoner-Omni 2B
Model Category=VLM Gua...
2026.02
90.24
HolmesVAU 2B
Model Category=VAD Model
2026.02
82.47
GuardReasoner-VL 7B
Model Category=VLM Gua...
2026.02
73.75
GuardReasoner-VL 3B
Model Category=VLM Gua...
2026.02
70.07
HolmesVAD 7B
Model Category=VAD Model
2026.02
46.15
LLaMA Guard 4 12B
Model Category=VLM Gua...
2026.02
9.33
Feedback
Search any
task
Search any
task