Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

RepoBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Long Code CompletionRepoBench >8k
Edit Sim51.24
12
Long Code CompletionRepoBench 4k-8k
Edit Similarity53.3
12
Long Code CompletionRepoBench 0-4k
Edit Similarity52.82
12
Code CompletionRepoBench-P
Similarity0.7305
10
CodingRepoBench
Pass@125.3
6
code generationRepoBench P
Score15.04
5
Showing 6 of 6 rows