Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
English Reading on TriviaQA
Loading...
83.7
EM
Llama-3.3-70B-Instruct
40.748
51.899
63.05
74.201
Apr 30, 2026
EM
Updated 1mo ago
Evaluation Results
Method
Method
Links
EM
Llama-3.3-70B-Instruct
Shots=5
2026.04
83.7
Qwen3-14B
Shots=5
2026.04
66.5
Llama-3.1-8B-Instruct
Shots=5
2026.04
62.4
Llama-Primus-Reasoning-8B
Shots=5
2026.04
62.15
Qwen3.5-9B
Shots=5
2026.04
59.45
Qwen3-8B
Shots=5
2026.04
57.9
XekRung-8B
Shots=5
2026.04
54.7
Foundation-Sec-8B-Reasoning
Shots=5
2026.04
52.75
SecGPT-14B
Shots=5
2026.04
42.4
Feedback
Search any
task
Search any
task