Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Coding on MBPP
Loading...
94.16
Score
Ministral-3-R
-3.01864
22.21043
47.4395
72.66857
Jul 15, 2024
Oct 19, 2024
Jan 23, 2025
Apr 29, 2025
Aug 3, 2025
Nov 7, 2025
Feb 12, 2026
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
Ministral-3-R
# Param.=8B
2026.02
94.16
Nemotron-Nano-v2
# Param.=9B
2026.02
93.39
MiniCPM-4.1
# Param.=8B
2026.02
91.05
Falcon-H1R
# Param.=7B
2026.02
91.05
MiniCPM-SALA
# Param.=9B
2026.02
89.11
Qwen3
# Param.=8B
2026.02
81.32
Llama-3-70B-Instruct
Type=Instruction-tuned
2024.07
0.823
Qwen2-72B-Instruct
Type=Instruction-tuned
2024.07
0.802
Qwen1.5-110B-Chat
Type=Instruction-tuned
2024.07
0.764
Mixtral-8x22B-Instruct
Type=Instruction-tuned
2024.07
0.759
Qwen1.5-72B-Chat
Type=Instruction-tuned
2024.07
0.719
Feedback
Search any
task
Search any
task