Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-term state poisoning evaluation on OpenClaw Average across conversation variants

4.35Harm Score (HS)

Grok-1

4.26724.82615.3855.9439May 7, 2026
Updated 23d ago

Evaluation Results

MethodLinks
2026.05
4.35
2026.05
4.42
2026.05
5.53
6.42