Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on ACP
Loading...
36.71
Accuracy
Qwen2.5-3B
25.27
28.24
31.21
34.18
Oct 9, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen2.5-3B
Evaluation Protocol=Si...
2025.10
36.71
Falcon3-7B
Evaluation Protocol=Si...
2025.10
36.29
K-LVR (1-Bytes)
Evaluation Protocol=En...
2025.10
35.43
K-LVR (MCV)
Evaluation Protocol=En...
2025.10
34.71
Union
Evaluation Protocol=En...
2025.10
25.86
Naive (MCV)
Evaluation Protocol=En...
2025.10
25.71
Feedback
Search any
task
Search any
task