Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Safety Classification Disagreement on WildChat Intent

0.4Disagreement Rate (per 1k conv)

Gemini 3.1

-1.29.620.431.2May 22, 2026
Updated 8d ago

Evaluation Results

MethodLinks
2026.05
0.4
2026.05
0.4
2026.05
0.4
2026.05
0.6
2026.05
0.7
2026.05
0.9
2026.05
1
2026.05
1.4
2026.05
1.5
2026.05
1.6
2026.05
1.8
2026.05
1.9
2026.05
2.2
2026.05
2.7
2026.05
2.8
2026.05
2.8
2026.05
2.9
2026.05
5.1
2026.05
5.6
2026.05
5.9
2026.05
6.2
2026.05
7.2
2026.05
10.1
2026.05
11.5
2026.05
11.5
2026.05
15.3
2026.05
19.2
2026.05
22.9
2026.05
24.1
2026.05
40.4