Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Prompt Classification benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Prompt Classification
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
SimpST
PolyGuard
F1 Score
100
32
3mo ago
Aegis 2.0
NemotronReasoning
F1 Score
87.3
32
3mo ago
Aegis
NemotronReasoning
F1 Score
89.6
32
3mo ago
SEA-SafeguardBench
SEA-Guard-12B
AUPRC (Average)
93.6
29
3mo ago
EXPGUARD (test)
ShieldGemma
Financial Performance Score
0
28
3mo ago
SEA-SafeguardBench English
SEA-Guard-12B
AUPRC
98.9
18
3mo ago
XSTest
WildGuard
F1 Score
94.8
16
3mo ago
WildG
Qwen3Guard-Gen
F1 Score
88.5
16
3mo ago
ToxiC
Qwen3Guard-Gen
F1 Score
81.9
16
3mo ago
SorryB
PolyGuard
F1 Score
97.2
16
3mo ago
SEval
YuFeng-XGuard
F1 Score
92.5
16
3mo ago
OverR
YuFeng-XGuard
F1 Score
41.9
16
3mo ago
OpenAIM
Qwen3Guard-Gen
F1
81.1
16
3mo ago
XSTest Text Prompt
GuardReasoner-8B
F1 Score
93.71
14
3mo ago
WildGuard Text Prompt
GuardReasonerVL-7B
F1 Score
90.46
14
3mo ago
ToxicChat Text Prompt
GPT4o-mini
F1 Score
96.27
14
3mo ago
Simple SafetyTest Text Prompt
Gemini2.5-Flash
F1 Score
100
14
3mo ago
OpenAI Moderation Text Prompt
LlamaGuard2-8B
F1 Score
88.89
14
3mo ago
Aegis Text Prompt 2.0
GPT4o-mini
F1 Score
83.52
14
3mo ago
HarmBench Text Prompt
GPT-OSS-SafeGuard-20B
F1 Score
98.85
14
3mo ago
SEALS (SEA)
SEA-Guard-12B
AUPRC
96.9
9
3mo ago
LlavaGuard Image Prompt
GPT4o-mini
F1 Score
0.752
7
3mo ago
BeaverTails-V Text-Image Prompt
ProGuard-7B
F1 Score
88.36
7
3mo ago
Showing 23 of 23 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs