Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Question Answering on TriviaQA (EM, Tokens(k))
Loading...
81.43
EM
GPT-4o-mini
17.3036
33.9518
50.6
67.2482
Jan 29, 2026
EM
Tokens (k)
Updated 4d ago
Evaluation Results
Method
Method
Links
EM
Tokens (k)
GPT-4o-mini
zero-shot-CoT=true, pa...
2026.01
81.43
0.69
PIR
zero-shot-CoT=true, pa...
2026.01
45.51
0.68
PIR
zero-shot-CoT=true, pa...
2026.01
25.56
0.64
PIR
zero-shot-CoT=true, pa...
2026.01
20.12
0.9
Reasoning Base
zero-shot-CoT=true, pa...
2026.01
19.77
1.29
Feedback
Search any
task
Search any
task