| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Safety Evaluation | 901 Safety Prompts (test) | Average Rank4.1337 | 11 | |
| Safety Assessment | Safety Prompts (randomly selected 200 samples per field) | Insensitivity Score1.5 | 9 | |
| Attack Success Rate Evaluation | HRL/LRL Safety Prompts English Multi-Image v1 | ASR2 | 6 | |
| Attack Success Rate Evaluation | HRL/LRL Safety Prompts English Text v1 | ASR1 | 6 |