Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Prompt Harmfulness Detection on HarmTextVideo
Loading...
99.47
F1 Score
GuardReasoner-Omni 4B
48.8324
61.9787
75.125
88.2713
Feb 3, 2026
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
GuardReasoner-Omni 4B
Model Category=VLM Gua...
2026.02
99.47
GuardReasoner-Omni 2B
Model Category=VLM Gua...
2026.02
99.15
GuardReasoner-VL 7B
Model Category=VLM Gua...
2026.02
88.49
GuardReasoner-VL 3B
Model Category=VLM Gua...
2026.02
87.86
LLaMA Guard 4 12B
Model Category=VLM Gua...
2026.02
50.78
Feedback
Search any
task
Search any
task