Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Coding Agent on CodeCI
Loading...
39.43
Avg@2
TDScaling
30.5172
32.8311
35.145
37.4589
Feb 3, 2026
Avg@2
Updated 4d ago
Evaluation Results
Method
Method
Links
Avg@2
TDScaling
Category=Proposed Impl...
2026.02
39.43
TOUCAN
Category=Tool-Learning...
2026.02
37.71
Qwen3-Coder-30B-A3B-Instruct
Category=Baseline
2026.02
35.43
APIGen-MT
Category=Tool-Learning...
2026.02
30.86
Simia
Category=Tool-Learning...
2026.02
30.86
Feedback
Search any
task
Search any
task