Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Agent Harm Evaluation on AgentHarm public

9.6HarmScore

gpt-4o + GBT-SE

7.11223.90640.757.494Jan 30, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.01
9.6
2026.01
10.3
2026.01
11.2
2026.01
12.4
2026.01
16.5
2026.01
19
2026.01
29.4
2026.01
71.8