Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Refusal Detection on XSTEST-RESP (full)

98.1RR (F1)

GPT-4

42.04456.59771.1585.703Jun 26, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.06
98.1
2024.06
92.8
2024.06
74.3
71
64.1
62.9
58.6
2024.06
48.5
2024.06
44.2