Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Robustness Evaluation on LLMBar
Loading...
83.07
Accuracy
Qwen3-30B-A3B-Thinking-2507
58.2972
64.7286
71.16
77.5914
Jan 7, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen3-30B-A3B-Thinking-2507
variant=Thinking
2026.01
83.07
QwQ-32B
2026.01
79.31
DeepSeek-R1
2026.01
79
Qwen3-Next-80B-A3B-Thinking
variant=Thinking
2026.01
77.55
DeepSeek-V3
2026.01
76.49
Qwen2.5-32B-Instruct
variant=Instruct
2026.01
67.71
Qwen3-Next-80B-A3B-Instruct
variant=Instruct
2026.01
64.55
Qwen3-30B-A3B-Instruct-2507
variant=Instruct
2026.01
59.25
Feedback
Search any
task
Search any
task