| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Jailbreak Defense | PAIR | Harmful Score1 | 37 | |
| Jailbreak Detection | PAIR | Accuracy98 | 30 | |
| Jailbreak Attack | PAIR | ASR76 | 27 | |
| Jailbreak Attack Defense | PAIR | ASR1 | 24 | |
| Adversarial Robustness | PAIR | ASR26 | 18 | |
| Cell Segmentation | Pair 5 (test) | SEG Score0.76 | 9 | |
| Cell Segmentation | Pair 4 (test) | Segmentation Score90 | 9 | |
| Cell Segmentation | Pair 3 (test) | SEG Score0.81 | 9 | |
| Cell Segmentation | Pair 1 (test) | SEG62 | 9 |