| SafeRLHF | | F1 Score0.94 | | 32 | 4d ago |
| WildGuardMix (test) | LEG large | F1 (Unsafe)75.83 | | 27 | 4d ago |
| XSTest (test) | LEG large | F192.91 | | 20 | 4d ago |
| XSTest | AprielGuard | F1 Score94 | | 16 | 4d ago |
| Wildguardmix | Apriel Guard | F1 Score76 | | 15 | 4d ago |
| HarmBench | IBM Granite Guardian 3.2 | Recall100 | | 14 | 4d ago |
| AegisSafetyTest V2 | | F1 Score87 | | 14 | 4d ago |
| AegisSafety V1 (test) | | F1 Score92 | | 14 | 4d ago |
| ToxicChat | | F1 Score0.81 | | 14 | 4d ago |
| XSTestResponse | AprielGuard | F1 Score0.96 | | 14 | 4d ago |
| Aya Redteaming | IBM Granite Guardian 3.1 | Recall94 | | 14 | 4d ago |
| SimpleSafetyTests | IBM Granite Guardian 3.2 | Recall100 | | 14 | 4d ago |
| ToxicChat (out-of-distribution) | Multi-head self-attn | F1 Score72.88 | | 11 | 4d ago |
| HarmBench (test) | | F1 Score90.5 | | 9 | 4d ago |
| OAI (test) | | F1 Score86.5 | | 9 | 4d ago |
| WildGuardMix-p (test) | | F1 Score93.2 | | 9 | 4d ago |
| DiaSafety (test) | GAUGE-mean | AUROC66.98 | | 8 | 4d ago |
| OS Bench | CLUE | Recall0.936 | | 8 | 4d ago |
| 3,000 Polish user prompts (test) | Bielik Guard 0.1B v1.1 | Precision77.65 | | 7 | 4d ago |
| MindGuard (test) | MindGuard 8B | AUROC98.2 | | 6 | 4d ago |
| OR-Bench | Mistral | F1 Score77 | | 3 | 4d ago |
| AdvBench | Llama 3 | F1 Score84 | | 3 | 4d ago |
| PTP | CREST-LARGE | F1 Score81.28 | | 2 | 4d ago |
| RTP-LX | CREST-LARGE | F1 Score79.86 | | 2 | 4d ago |
| MultiJail | CREST-BASE | F1 Score0.9335 | | 2 | 4d ago |