Prompt Harmfulness Detection

Benchmarks

Dataset Name	SOTA Method	Metric
AegisSafety (test)	LLaMA Guard 3	F1 Score99.5	41	1mo ago
Text & Image Benchmarks Average	GuardReasoner-Omni 2B	F1 Score83.84	19	4mo ago
Combined Prompt Harmfulness Suite (ToxicChat, HarmBench, OpenAI Mod, Aegis SafetyTest, WildGuard Test)	GuardReasoner	Macro Avg Harmfulness Detection Rate84.4	17	1mo ago
HarmVideo	GuardReasoner-Omni 4B	F1 Score95.5	7	4mo ago
FVC	GuardReasoner-Omni 4B	F1 Score67.86	7	4mo ago
XD-Violence	GuardReasoner-Omni 2B	F1 Score96.82	7	4mo ago
UCF-crime	GuardReasoner-Omni 4B	F1 Score91.67	7	4mo ago
Video Benchmarks Average	GuardReasoner-Omni 4B	F1 Score94.84	5	4mo ago
HarmTextVideo	GuardReasoner-Omni 4B	F1 Score99.47	5	4mo ago

Showing 9 of 9 rows