Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Math Reasoning on MATH (test) (Accuracy)
Loading...
46.8
Accuracy
MENTORCOLLAB MLP
12.272
21.236
30.2
39.164
Feb 5, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
MENTORCOLLAB MLP
Generator=Qwen3-8B-Bas...
2026.02
46.8
Co-LLM
Generator=Qwen3-8B-Bas...
2026.02
25
MENTORCOLLAB MLP
Generator=Gemma-3-4B-P...
2026.02
21
MENTORCOLLAB MLP
Generator=Llama3.1-8B,...
2026.02
18
MENTORCOLLAB FREE
Generator=Gemma-3-4B-P...
2026.02
15.8
Generator Baseline
Model=Gemma-3-4B-PT, D...
2026.02
14.2
Nudging
Generator=Qwen3-1.7B,...
2026.02
13.6
Feedback
Search any
task
Search any
task