Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LlavaGuard

Benchmarks

Task NameDataset NameSOTA ResultTrend
Unsafe content detectionLlavaGuard
Accuracy82
14
Prompt ClassificationLlavaGuard Image Prompt
F1 Score0.752
7
Out-of-Taxonomy Risk DetectionLlavaGuard
F1 Score66.87
4
OOD safety category inference (Stage 2)LlavaGuard
Reward Mean13.28
4
Showing 4 of 4 rows