| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Safety Evaluation | DirectHarm 4 | Attack Success Rate9 | 87 | |
| Safety Evaluation | DirectHarm | Harmfulness Score5 | 84 | |
| Harmfulness Evaluation | DirectHarm | Harmfulness Score5 | 56 | |
| Harmfulness Evaluation | DirectHarm (test) | Harmfulness Score (Llama-Guard-3B)5 | 56 |