Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multistep Reasoning on MUSR-fr
Loading...
33.79
Average Score
Gamayun
28.5276
29.8938
31.26
32.6262
Dec 25, 2025
Average Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Average Score
Gamayun
Zero-shot=true, Chat T...
2025.12
33.79
EuroLM
Zero-shot=true, Chat T...
2025.12
33.13
Llama3.2
Zero-shot=true, Chat T...
2025.12
31.94
Qwen2.5
Zero-shot=true, Chat T...
2025.12
30.99
Qwen3
Zero-shot=true, Chat T...
2025.12
28.86
Gemma3
Zero-shot=true, Chat T...
2025.12
28.73
Feedback
Search any
task
Search any
task