Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Scientific Reasoning on SuperGPQA (avg@3)
Loading...
56.8
avg@3
Qwen3-30A3-2507
38.392
43.171
47.95
52.729
Dec 6, 2025
avg@3
Updated 4d ago
Evaluation Results
Method
Method
Links
avg@3
Qwen3-30A3-2507
Release Identifier=2507
2025.12
56.8
Qwen3-32B-2504
Release Identifier=2504
2025.12
54.1
Nanbeige4-3B-Thinking
Release Identifier=2511
2025.12
53.2
Qwen3-14B-2504
Release Identifier=2504
2025.12
46.8
Qwen3-4B-2507
Release Identifier=2507
2025.12
46.7
Qwen3-8B-2504
Release Identifier=2504
2025.12
39.1
Feedback
Search any
task
Search any
task