Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Refusal Detection on XSTEST-RESP (full)

98.1RR (F1)

GPT-4

42.04456.59771.1585.703Jun 26, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.06
98.1
2024.06
92.8
2024.06
74.3
71
64.1
62.9
58.6
2024.06
48.5
2024.06
44.2