Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Code Generation on BigCodeBench Lite-Pro Compositional Stream

66.7Accuracy

MEMPROBE

37.0644.75552.4560.145Jun 1, 2026
Updated 1d ago

Evaluation Results

MethodLinks
2026.06
66.721.9-
2026.06
64.6-2.1
2026.06
62.517.7-
2026.06
62.5-0
2026.06
60.4--
2026.06
60.4--
2026.06
59--
2026.06
59--
2026.06
58.313.5-
2026.06
56.3-2
2026.06
50.7-2.1
2026.06
48.63.8-
2026.06
45.8-1.4
2026.06
44.8--
2026.06
44.8--
2026.06
44.40.4-
2026.06
44.4-0.6
2026.06
43.81-
2026.06
39.65.2-
2026.06
38.2-1.4