Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Tool Usage on BigCodeBench (Pass@1)
Loading...
41.67
Pass@1
Qwen2.5-7B-Ins + ADR
38.1652
39.0751
39.985
40.8949
May 29, 2026
Pass@1
Updated 2d ago
Evaluation Results
Method
Method
Links
Pass@1
Qwen2.5-7B-Ins + ADR
Base Model=Qwen2.5-7B-...
2026.05
41.67
Qwen2.5-7B-Ins + KodCode
Base Model=Qwen2.5-7B-...
2026.05
41.27
Qwen2.5-7B-Ins
Base Model=Qwen2.5-7B-...
2026.05
38.3
Feedback
Search any
task
Search any
task