Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Code Generation on LiveCodeBench v6, HumanEval+, MBPP+, and SciCode

0.992Pass@1

DeepSeek-V3.1

0.297280.477640.6580.83836Jan 14, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
0.992
2026.01
0.989
2026.01
0.939
2026.01
0.93
2026.01
0.902
2026.01
0.872
2026.01
0.86
2026.01
0.86
2026.01
0.835
2026.01
0.76
2026.01
0.758
2026.01
0.731
2026.01
0.695
2026.01
0.662
2026.01
0.559
2026.01
0.391
2026.01
0.384
2026.01
0.324