Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Code Generation on LiveCodeBench v6 (2025-02 to 2025-05)

74.1Accuracy

Qwen3-235B-A22B-Thinking-2507

31.66842.68453.764.716May 6, 2026May 7, 2026May 8, 2026May 9, 2026May 10, 2026May 11, 2026
Updated 21d ago

Evaluation Results

MethodLinks
2026.05
74.1---
2026.05
72.610082.352.5
72.5---
71.1---
2026.05
69.498.778.547.1
2026.05
69.2---
2026.05
68.7---
2026.05
68.710073.747.5
2026.05
67.8---
67.498.473.148
2026.05
66.8---
2026.05
65.298.275.243.5
2026.05
65---
2026.05
64.989.574.446.3
64.997.671.244.3
2026.05
64.8---
2026.05
64.6---
6397.672.439.3
60.396.865.438.5
2026.05
59.498.46834
5999.26634
2026.05
58.3---
2026.05
57.9---
57.698.464.732.4
2026.05
5597.660.329.9
53.897.656.429.9
2026.05
47.188.249.624.6
4594.455.813.1
2026.05
43.182.949.219
40.589.54115.2
2026.05
33.3---