Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Safety Classification Disagreement on WildChat Content

0.4Disagreement Rate (per 1k Conversations)

Gemini 3.1

-2.23215.53433.351.066May 22, 2026
Updated 8d ago

Evaluation Results

MethodLinks
2026.05
0.4
2026.05
0.7
2026.05
1
2026.05
1.1
2026.05
1.5
2026.05
1.5
2026.05
1.7
2026.05
1.9
2026.05
1.9
2026.05
2.2
2026.05
2.4
2026.05
2.5
2026.05
2.9
2026.05
4.1
2026.05
4.1
2026.05
4.6
2026.05
4.8
2026.05
5.1
2026.05
5.1
2026.05
6.8
2026.05
6.8
2026.05
8.1
2026.05
8.1
2026.05
9.7
2026.05
12.1
2026.05
20.5
2026.05
29.3
2026.05
42.2
2026.05
47.2
2026.05
66.2