Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Question Answering on SuperGPQA Law
Loading...
43.8
Accuracy
Llama3.1-70B
27.784
31.942
36.1
40.258
Jan 20, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Llama3.1-70B
Post-Training Stage=DPO
2026.01
43.8
Qwen3-30B
Post-Training Stage=DPO
2026.01
42.5
Qwen3-30B
Post-Training Stage=OOB
2026.01
42.4
Qwen3-30B
Post-Training Stage=SFT
2026.01
42.1
Llama3.1-70B
Post-Training Stage=SFT
2026.01
41.7
Llama3.1-70B
Post-Training Stage=OOB
2026.01
40.3
Nemotron1.5-49B
Post-Training Stage=DPO
2026.01
38
Nemotron1.5-49B
Post-Training Stage=OOB
2026.01
37.7
Nemotron1.5-49B
Post-Training Stage=SFT
2026.01
37.6
SaulLM
Parameters=141B
2026.01
28.4
Feedback
Search any
task
Search any
task