| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Safety Classification | WildGuardMix (test) | F1 (Unsafe)75.83 | 27 | |
| Safety Classification | Wildguardmix | F1 Score76 | 15 | |
| Prompt Safety Detection | WildGuardMix (train) | AUROC0.8971 | 15 | |
| Prompt Safety Detection | WildGuardMix (test) | AUROC0.8882 | 15 | |
| Safety Classification | WildGuardMix-p (test) | F1 Score93.2 | 9 | |
| Safety Routing | WildGuardMix | Routing F154.34 | 5 | |
| Safety Routing | WildGuardMix-p | Routing F10.5054 | 5 | |
| Prompt-Response Safety Routing | WildGuardMix | Routing F161.41 | 5 | |
| Prompt-only Safety Routing | WildGuardMix-p | Routing F1 Score61.28 | 5 | |
| Safety Alignment | WildGuardMix | Win Rate55 | 5 | |
| Explainability classification | WildGuardMix human-annotated (test) | F1 Score60.69 | 3 | |
| Response Generation | WildGuardMix | Win Count61 | 3 |