Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Function-level Code Generation on MBPP+ augmented (test)

79.6Pass@1

LLaDA2.0-flash

39.76850.10960.4570.791Oct 31, 2024Jan 21, 2025Apr 14, 2025Jul 6, 2025Sep 27, 2025Dec 19, 2025Mar 12, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.01
79.6
79.4
2026.01
77.1
2026.01
76
2026.01
75.1
2026.01
75.1
2026.01
74.1
2026.01
72.8
2026.01
72.8
2026.01
71.7
2026.01
71.2
2024.10
70.7
2026.01
70.4
2026.01
70.1
2026.01
69
2026.01
69
2024.10
67.7
2024.10
67.2
2026.01
67.2
2026.01
67.2
2024.10
66.9
2024.10
66.7
2024.10
66.4
2026.01
65.6
2024.10
65.4
2024.10
65.2
2026.01
65.1
2026.01
64.6
2024.10
64.3
2024.10
63.2
2024.10
62.9
2026.01
62.2
2026.01
61.9
2024.10
61.7
2026.03
60.58
2024.10
60.2
2026.03
59.52
2026.01
59
2026.03
58.99
2024.10
58.6
2026.03
58.47
2026.03
58.47
2026.03
58.2
2024.10
57.9
2026.03
57.67
2026.03
57.67
2026.03
57.14
2026.01
57.1
2024.10
56.6
2026.03
54.76
2026.03
54.76
2024.10
53.1
2024.10
49.7
2024.10
49.1
2026.01
44.4
2026.01
41.3