Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
General Knowledge on SuperGPQA
Loading...
36.21
pass@1
Qwen3-8B
16.4604
21.5877
26.715
31.8423
Jan 8, 2026
pass@1
Updated 4d ago
Evaluation Results
Method
Method
Links
pass@1
Qwen3-8B
Backbone=Qwen3-8B
2026.01
36.21
RelayLLM (Difficulty-Aware)
Student Model Backbone...
2026.01
29.93
RelayLLM (Simple)
Student Model Backbone...
2026.01
29.85
CITER
Student Model Backbone...
2026.01
28.25
GRPO
Student Model Backbone...
2026.01
26.01
Base Model
Student Model Backbone...
2026.01
24.46
RelayLLM (Simple)
Student Model Backbone...
2026.01
21.35
RelayLLM (Difficulty-Aware)
Student Model Backbone...
2026.01
20.88
CITER
Student Model Backbone...
2026.01
20.34
GRPO
Student Model Backbone...
2026.01
19.91
Base Model
Student Model Backbone...
2026.01
17.22
Feedback
Search any
task
Search any
task