Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Code Generation on MBPP (Acc@t1, Acc@t2, Δ(t1,t2))

66.2Accuracy @ t1

Prompt based

15.03228.31641.654.884May 22, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.05
66.2--
2025.05
66.2714.8
2025.05
66.2736.8
2025.05
65.469.44
2025.05
65.469.23.8
2025.05
63--
2025.05
63685
2025.05
63685
2025.05
54.459.24.8
2025.05
54.459.65.2
2025.05
54.457.83.4
2025.05
54.456.82.4
2025.05
54.458.64.2
2025.05
54.459.45
2025.05
37--
2025.05
31.2--
2025.05
28.4--
2025.05
28.44415.6
2025.05
28.429.61.2
2025.05
28.442.414
2025.05
28.448.820.4
2025.05
28.447.419
2025.05
20.4--
2025.05
20.4232.6
2025.05
20.4243.6
2025.05
20.420.80.4
2025.05
20.423.22.8
2025.05
20.422.62.2
2025.05
17--