| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Policy Responsiveness | LlavaGuard v1 (test) | PER98.75 | 20 | |
| Unsafe content detection | LlavaGuard | Accuracy82 | 14 | |
| Visual Compliance Verification | LlavaGuard 1290 samples (test) | Unsafe F1 Score93 | 13 | |
| Prompt Classification | LlavaGuard Image Prompt | F1 Score0.752 | 7 | |
| Out-of-Taxonomy Risk Detection | LlavaGuard | F1 Score66.87 | 4 | |
| OOD safety category inference (Stage 2) | LlavaGuard | Reward Mean13.28 | 4 |