Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Robustness Evaluation on LLMBar

83.07Accuracy

Qwen3-30B-A3B-Thinking-2507

58.297264.728671.1677.5914Jan 7, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
83.07
2026.01
79.31
79
2026.01
77.55
76.49
2026.01
67.71
2026.01
64.55
2026.01
59.25