Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
English Knowledge on GPQA
Loading...
51.52
Accuracy
Llama-3.3-70B-Instruct
11.5944
21.9597
32.325
42.6903
Apr 30, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Llama-3.3-70B-Instruct
Shots=5
2026.04
51.52
Qwen3-14B
Shots=5
2026.04
48.48
Qwen3.5-9B
Shots=5
2026.04
45.96
XekRung-8B
Shots=5
2026.04
43.43
Qwen3-8B
Shots=5
2026.04
41.92
Foundation-Sec-8B-Reasoning
Shots=5
2026.04
31.7
Llama-3.1-8B-Instruct
Shots=5
2026.04
30.4
SecGPT-14B
Shots=5
2026.04
13.13
Feedback
Search any
task
Search any
task