Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Safety classification disagreement benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Safety classification disagreement
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
WildChat Content
Gemini 3.1
Disagreement Rate (per 1k Conversations)
0.4
30
8d ago
WildChat Intent
Gemini 3.1
Disagreement Rate (per 1k conv)
0.4
30
8d ago
Showing 2 of 2 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task