Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Code Generation on MBPP (Pass@1 Python, Pass@1 Rust)

96.5Pass@1 Accuracy (Python)

SYMPHONY-L

67.17274.78682.490.014Jan 30, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.01
96.597.4
2026.01
92.794.6
2026.01
91.8-
2026.01
91-
2026.01
89-
2026.01
87.7-
2026.01
81.1-
2026.01
8071
2026.01
77.175.4
2026.01
71.4-
2026.01
71-
2026.01
68.3-