Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multiple Choice Question Answering on BGB (test)
Loading...
78.8
Exact Accuracy
GPT-5 (min. reasoning)
63.408
67.404
71.4
75.396
Jan 20, 2026
Exact Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Exact Accuracy
GPT-5 (min. reasoning)
Train Data=Base Model
2026.01
78.8
GPT-5-mini (min. reasoning)
Train Data=Base Model
2026.01
75.8
Gemma 3 (12B)
Train Data=Difficulty-...
2026.01
75.1
LLaMA 3.1 (8B)
Train Data=Difficulty-...
2026.01
68.2
Gemma 3 (12B)
Train Data=Standard In...
2026.01
67.1
Gemma 3 (12B)
Train Data=Base Model
2026.01
67
LLaMA 3.1 (8B)
Train Data=Standard In...
2026.01
64.2
LLaMA 3.1 (8B)
Train Data=Base Model
2026.01
64
Feedback
Search any
task
Search any
task