Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Prompt Classification benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Prompt Classification
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
SimpST
PolyGuard
F1 Score
100
32
1mo ago
Aegis 2.0
NemotronReasoning
F1 Score
87.3
32
1mo ago
Aegis
NemotronReasoning
F1 Score
89.6
32
1mo ago
SEA-SafeguardBench
SEA-Guard-12B
AUPRC (Average)
93.6
29
1mo ago
EXPGUARD (test)
ShieldGemma
Financial Performance Score
0
28
1mo ago
SEA-SafeguardBench English
SEA-Guard-12B
AUPRC
98.9
18
1mo ago
XSTest
WildGuard
F1 Score
94.8
16
1mo ago
WildG
Qwen3Guard-Gen
F1 Score
88.5
16
1mo ago
ToxiC
Qwen3Guard-Gen
F1 Score
81.9
16
1mo ago
SorryB
PolyGuard
F1 Score
97.2
16
1mo ago
SEval
YuFeng-XGuard
F1 Score
92.5
16
1mo ago
OverR
YuFeng-XGuard
F1 Score
41.9
16
1mo ago
OpenAIM
Qwen3Guard-Gen
F1
81.1
16
1mo ago
XSTest Text Prompt
GuardReasoner-8B
F1 Score
93.71
14
1mo ago
WildGuard Text Prompt
GuardReasonerVL-7B
F1 Score
90.46
14
1mo ago
ToxicChat Text Prompt
GPT4o-mini
F1 Score
96.27
14
1mo ago
Simple SafetyTest Text Prompt
Gemini2.5-Flash
F1 Score
100
14
1mo ago
OpenAI Moderation Text Prompt
LlamaGuard2-8B
F1 Score
88.89
14
1mo ago
Aegis Text Prompt 2.0
GPT4o-mini
F1 Score
83.52
14
1mo ago
HarmBench Text Prompt
GPT-OSS-SafeGuard-20B
F1 Score
98.85
14
1mo ago
SEALS (SEA)
SEA-Guard-12B
AUPRC
96.9
9
1mo ago
LlavaGuard Image Prompt
GPT4o-mini
F1 Score
0.752
7
1mo ago
BeaverTails-V Text-Image Prompt
ProGuard-7B
F1 Score
88.36
7
1mo ago
Showing 23 of 23 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs