Share your thoughts, 1 month free Claude Pro on usSee more

Prompt Harmfulness Detection on Text & Image Benchmarks Average

83.84F1 Score

GuardReasoner-Omni 2B

Updated 4mo ago

Evaluation Results

Method	Links
GuardReasoner-Omni 2B 2026.02		83.84
Qwen3Guard Gen 8B (loose) 2026.02		83.6
GuardReasoner-Omni 4B 2026.02		83.03
GuardReasoner 8B 2026.02		81.09
PolyGuard-Qwen 7B 2026.02		79.53
GuardReasoner-VL 7B 2026.02		79.36
GuardReasoner-Omni 2B 2026.02		79.1
GuardReasoner-VL 3B 2026.02		78.67
GuardReasoner-VL 7B 2026.02		78.54
WildGuard 7B 2026.02		77.99
GuardReasoner-VL 3B 2026.02		77.65
GuardReasoner-Omni 4B 2026.02		77.17
Qwen3Guard Gen 8B (strict) 2026.02		76.41
Aegis Guard Permissive 7B 2026.02		73.83
Aegis Guard Defensive 7B 2026.02		72.99
ShieldGemma 9B 2026.02		68.77
LLaMA Guard 3 8B 2026.02		68.47
LLaMA Guard 4 12B 2026.02		51.05
LLaMA Guard 4 12B 2026.02		42.27