Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Coding on HumanEval (mean score)

0.9695HumanEval Mean Score

Ministral-3-R

-0.007580.2460850.499750.753415Jan 21, 2026Jan 24, 2026Jan 28, 2026Feb 1, 2026Feb 4, 2026Feb 8, 2026Feb 12, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
0.9695
2026.02
0.9634
2026.02
0.9512
2026.01
0.945
2026.02
0.939
2026.02
0.939
2026.01
0.933
2026.02
0.9146
2026.01
0.823
2026.01
0.823
0.817
2026.01
0.799
2026.01
0.793
2026.01
0.75
2026.01
0.72
2026.01
0.689
2026.01
0.671
2026.01
0.64
2026.01
0.634
2026.01
0.616
2026.01
0.616
2026.01
0.61
2026.01
0.61
2026.01
0.567
2026.01
0.506
2026.01
0.384
2026.01
0.378
2026.01
0.03