LlavaGuard

Benchmarks

Task Name	Dataset Name	SOTA Result
Unsafe content detection	LlavaGuard	Accuracy83.81	21
Policy Responsiveness	LlavaGuard v1 (test)	PER98.75	20
Visual Compliance Verification	LlavaGuard 1290 samples (test)	Unsafe F1 Score93	13
Content safety guardrailing	LlavaGuard	Score (%)87.5	12
Prompt Classification	LlavaGuard Image Prompt	F1 Score0.752	7
Out-of-Taxonomy Risk Detection	LlavaGuard	F1 Score66.87	4
OOD safety category inference (Stage 2)	LlavaGuard	Reward Mean13.28	4

Showing 7 of 7 rows