| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Safety Classification | WildGuardMix (test) | F1 Score95.3 | 47 | |
| Computational Complexity Analysis | WildGuardMix 1.0 (test) | FLOPs (MFLOPs)0.004 | 40 | |
| Safety Monitoring | WildGuardMix (test) | Accuracy89.9 | 40 | |
| LLM Moderation | WildGuardMix (test) | ASR14.59 | 28 | |
| Safety Evaluation | WildGuardMix | Safety Score0.8974 | 22 | |
| Harmful prompt classification | WildGuardMix (val) | F1 Score98.34 | 20 | |
| Safety Classification | Wildguardmix | F1 Score76 | 15 | |
| Prompt Safety Detection | WildGuardMix (train) | AUROC0.8971 | 15 | |
| Prompt Safety Detection | WildGuardMix (test) | AUROC0.8882 | 15 | |
| Post-generation Inference | WildGuardMix LLaDA-2.0-mini (test) | Inference Time0.34 | 10 | |
| Post-generation Inference | WildGuardMix LLaDA-1.5 (test) | Inference Time0.36 | 10 | |
| Post-generation Inference | WildGuardMix LLaDA-8B-Instruct (test) | Inference Time0.31 | 10 | |
| Post-generation Inference | WildGuardMix LLaDA-8B-Base (test) | Inference Time0.57 | 10 | |
| Safety Classification | WildGuardMix-p (test) | F1 Score93.2 | 9 | |
| Safety Routing | WildGuardMix | Routing F154.34 | 5 | |
| Safety Routing | WildGuardMix-p | Routing F10.5054 | 5 | |
| Prompt-Response Safety Routing | WildGuardMix | Routing F161.41 | 5 | |
| Prompt-only Safety Routing | WildGuardMix-p | Routing F1 Score61.28 | 5 | |
| Safety Alignment | WildGuardMix | Win Rate55 | 5 | |
| Explainability classification | WildGuardMix human-annotated (test) | F1 Score60.69 | 3 | |
| Response Generation | WildGuardMix | Win Count61 | 3 |