| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MM-SafetyBench | CASA | Attack Success Rate (ASR)0.2 | 56 | 2mo ago | |
| FORTRESS | ReasoningGuard | ASR9.8 | 24 | 3mo ago | |
| PAIR | Self-Reminder | ASR1 | 24 | 27d ago | |
| GCG | STAR-1 | ASR0 | 24 | 27d ago | |
| Mousetrap | Paraphrase | FFR (Reasoning)22.7 | 17 | 27d ago | |
| AutoRAN | ThinkingI | Reasoning Failure Rate (FFR)0 | 17 | 27d ago | |
| SorryBench | Paraphrase | FFR (Reasoning)56.8 | 17 | 27d ago | |
| FigStep, MM-SafetyBench, SPA-VL Average | VSFA | Attack Success Rate (ASR)14.18 | 16 | 2mo ago | |
| SPA-VL | VSFA | Attack Success Rate (ASR)22.64 | 16 | 2mo ago | |
| JailbreakV-28K v1 (test) | Defense Success Rate (Noise - T)38.16 | 6 | 3mo ago |