Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Tool Use on LitQA 2
Loading...
55.6
Accuracy
Olmo 3.1 32B Instruct
25.024
32.962
40.9
48.838
Dec 15, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Olmo 3.1 32B Instruct
Stage=Final Instruct 3.1
2025.12
55.6
Olmo 3.1 32B Instruct
Stage=DPO
2025.12
53.3
Olmo 3.1 32B Instruct
Stage=SFT
2025.12
47.6
Qwen 3 32B
Thinking=No, Parameter...
2025.12
46.7
Olmo 3 7B Instruct
stage=DPO
2025.12
43.3
Qwen 3 8B
stage=Instruct
2025.12
39.6
Olmo 3 7B Instruct
stage=Final Instruct
2025.12
38.2
Olmo 3 7B Instruct
stage=SFT
2025.12
38
Qwen 3 VL 32B Instruct
Parameters=32B
2025.12
32
Qwen 3 VL 8B Inst
stage=Instruct
2025.12
30.7
Qwen 2.5 7B
stage=Instruct
2025.12
29.8
Qwen 2.5 32B
Parameters=32B
2025.12
26.2
Feedback
Search any
task
Search any
task