| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Jailbreaking Safety Evaluation | Fortress | Safety Score87.7 | 30 | |
| Jailbreak Attack Defense | FORTRESS | ASR9.8 | 24 | |
| Harmful Content Detection | Fortress | ASR18.6 | 12 | |
| Safety Evaluation | Fortress | JailBreak Score2.8 | 12 | |
| Overrefusal Evaluation | Fortress OR | Helpfulness Score97.6 | 12 | |
| Non-Agentic Performance Evaluation | Fortress (test) | Mean Score78.75 | 4 | |
| Safety Evaluation | Fortress | Cost per Accuracy Point ($)0.0016 | 4 |