Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Overrefusal Evaluation on OrBench-H

99.85RR

Db as Alpaca

1.133226.761652.3978.0184Oct 15, 2025Nov 8, 2025Dec 3, 2025Dec 28, 2025Jan 21, 2026Feb 15, 2026Mar 12, 2026
Updated 20d ago

Evaluation Results

MethodLinks
2026.03
99.85
2026.03
98.48
2025.10
89.3
2026.03
84.61
2026.03
77.1
2025.10
72.6
2026.03
60.58
2026.03
57.92
2026.03
57.09
2026.03
55.34
2025.10
37.5
2025.10
35.9
2025.10
27.7
2025.10
25.2
2025.10
24.2
2026.03
23.88
2025.10
23.4
2026.03
16.83
2026.03
12.89
2026.03
5.84
2026.03
4.93