Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Unsafe Prompt Detection benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Unsafe Prompt Detection
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
ToxicChat (test)
OpenAI Moderation API
Precision
0.815
16
1mo ago
XSTest (test)
OpenAI Moderation API
Precision
87.8
7
1mo ago
XSTest
GradSafe-Zero
AUPRC
93.6
4
1mo ago
ToxicChat
GradSafe-Zero
AUPRC
75.5
4
1mo ago
Showing 4 of 4 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task