Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Code Generation on RepoBench-P Python XF-First

52.4Exact Match (EM)

Ours

37.63241.46645.349.134May 18, 2026
Updated 15d ago

Evaluation Results

MethodLinks
2026.05
52.473.81183.8
2026.05
51.873.42455.6
2026.05
51.273.12685.9
2026.05
49.871.52856.2
2026.05
48.370.23126.8
2026.05
38.262.4451.8