| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Safety Evaluation | VLGuard | ASR (Before)0.23 | 24 | |
| Vision-text safety classification | VLGuard | AUPRC (Prompt)0.8843 | 9 | |
| Unsafe content detection | VLGuard | F1 Score79.3 | 9 | |
| Multimodal Safety Evaluation | VLGuard (test) | Accuracy86.78 | 6 | |
| Multimodal Jailbreaking | VLGuard Unsafe (OOD) | ASR66.7 | 6 | |
| Over-Prudence Evaluation | VLGuard | RR (Before)4.48 | 6 | |
| Jailbreak Attack | VLGuard Safe | Attack Success Rate (ASR)8.44 | 5 | |
| Jailbreak Attack | VLGuard Image Unsafe | ASR52.49 | 5 | |
| Jailbreak Attack | VLGuard Text Unsafe | ASR34.59 | 5 | |
| Jailbreak Attack | VLGuard (All) | ASR17.88 | 5 |