Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Response Harmfulness Detection on XSTEST-RESP

95.48Response Harmfulness F1

GuardReasoner-Omni 4B

34.556850.373466.1982.0066Jun 26, 2024Oct 21, 2024Feb 15, 2025Jun 12, 2025Oct 7, 2025Feb 1, 2026May 29, 2026
Updated 2d ago

Evaluation Results

MethodLinks
2026.02
95.48
2024.06
95.4
2026.05
95.33
2024.06
94.7
2026.02
94.7
2026.05
94.7
2026.05
94.7
2026.02
94.34
2026.05
94.34
2026.05
94.34
2026.05
94.19
2026.02
93.67
2024.06
93.4
2026.02
92.9
2026.02
92.72
2026.05
92.54
2026.02
92.5
2026.02
92.12
2026.05
92.02
2026.05
91.9
2026.05
91.36
2026.05
91.36
2024.06
91.3
90.8
2026.05
90.8
2026.05
90.8
2026.05
90.66
2026.02
90.4
2026.05
90.4
2026.05
90.4
2026.05
90.12
2026.02
89.8
2026.02
87.67
2026.05
87.67
2026.05
87.67
2026.05
86.9
2026.05
86.62
2026.05
86.55
2026.05
86.2
2026.05
85.88
85
2026.02
83.6
2026.05
83.6
82
2026.05
82
81.2
2024.06
80.5
2026.05
75.65
2026.05
74.75
2026.02
73.86
2026.05
73.86
2026.05
73.86
2026.05
73.36
2026.02
72
2026.05
72
70.5
70.2
2026.05
65.55
2026.05
65.55
2026.05
65.12
2026.02
64.57
2026.02
64.5
2026.05
64.5
2024.06
60.4
2026.02
60.4
2026.05
60.4
2026.05
60.4
2024.06
52.8
2026.02
52.8
2026.05
52.8
2026.05
52.8
2024.06
49
46.6
2026.05
46.6
2026.05
45.95
2024.06
36.9