| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| BeaverTails V Text-Image Response | GuardReasoner-8B | F1 Score84.02 | 23 | 2d ago | |
| XSTest Text Response | GuardReasoner-8B | F1 Score98.43 | 16 | 2d ago | |
| Wild Guard Text Response | DynaGuard-8B | F1 Score93.17 | 16 | 2d ago | |
| Aegis Text Response 2.0 | ProGuard-7B | F1 Score82.27 | 16 | 2d ago | |
| Generic Response Classification Suite (Aegis2.0, Beavertails, SEval, SafeRLHF, Think, WildG, XSTest) | Qwen3Guard-Gen-4B | Aegis2.086.5 | 16 | 4d ago | |
| SEA-SafeguardBench CG Cultural | SEA-Guard | AUPRC (English)75.4 | 16 | 4d ago | |
| SafeQA English | Qwen3Guard-Gen 8B | AUPRC97.7 | 9 | 4d ago | |
| SEA-SafeguardBench | Qwen3Guard-Gen 8B | AUPRC89.7 | 9 | 4d ago | |
| SEA-SafeguardBench English | LlamaGuard-3 8B | AUPRC92.1 | 9 | 4d ago |