Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Overrefusal evaluation on IH-Challenge overrefusal

1Performance Score

GPT-5-Mini-R

0.78160.83830.8950.9517Mar 11, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
1
2026.03
0.79