Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Refusal Detection on XSTEST-RESP (full)
Loading...
98.1
RR (F1)
GPT-4
42.044
56.597
71.15
85.703
Jun 26, 2024
RR (F1)
Updated 4d ago
Evaluation Results
Method
Method
Links
RR (F1)
GPT-4
Specific Model=gpt-4-0...
2024.06
98.1
WILDGUARD
2024.06
92.8
LibrAI-LongFormer-ref
Base Model=LongFormer
2024.06
74.3
Keyword-based detector
2024.06
71
Llama-Guard2
2024.06
64.1
Llama-Guard
2024.06
62.9
OAI Mod. API
2024.06
58.6
Aegis-Guard-P
Policy=Permissive
2024.06
48.5
Aegis-Guard-D
Policy=Defensive
2024.06
44.2
Feedback
Search any
task
Search any
task