Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Response Harmfulness Classification on WildGuard (test)

79.48F1 (Total)

GuardReasoner-VL 7B

14.396831.293448.1965.0866Jun 26, 2024Oct 1, 2024Jan 7, 2025Apr 15, 2025Jul 22, 2025Oct 28, 2025Feb 3, 2026
Updated 5d ago

Evaluation Results

MethodLinks
2026.02
79.48---
2026.02
78.83---
2026.02
78.2---
2026.02
78.06---
2026.02
77.89---
2026.02
77.57---
2024.06
77.373.681.3-
2024.06
76.867.785-
2026.02
76.8---
2026.02
76.4---
2026.02
76.12---
2024.06
75.468.481.5-
2026.02
75.4---
2026.02
70.8---
66.547.978.2-
2026.02
66.39---
2024.06
63.451.274.3-
2026.02
63.4---
2024.06
63.261.764.2-
2024.06
60.151.870.3-
2026.02
60.1---
2024.06
56.44962.4-
2026.02
56.4---
50.525.866.7-
2024.06
49.140.457.6-
2026.02
49.1---
2026.02
47---
2024.06
45.741.950.2-
2026.02
45.7---
16.914.718.8-
2026.05
---65.24
2026.05
---71.43
2026.05
---50
2026.05
---50.5
2026.05
---66.5
2026.05
---70.8
2026.05
---49.1
2026.05
---56.4
2026.05
---77.5
2026.05
---20.13
2026.05
---47
2026.05
---45.7
2026.05
---60.1
2026.05
---76.8
2026.05
---63.4
2026.05
---75.4
2026.05
---17.56
2026.05
---74.81
2026.05
---79.7
2026.05
---78.2
2026.05
---77.23
2026.05
---81.23