Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Over-Refusal Evaluation on Benign prompt dataset
Loading...
17
Over-Refusal Rate
Base
13.68
36.09
58.5
80.91
Apr 17, 2026
Over-Refusal Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Over-Refusal Rate
Base
Model=LLaVA-v1.6-Mistr...
2026.04
17
Beam
Model=LLaVA-v1.6-Mistr...
2026.04
26.3
Safety
Model=LLaVA-v1.6-Mistr...
2026.04
66.3
CB
Model=LLaVA-v1.6-Mistr...
2026.04
100
Feedback
Search any
task
Search any
task