Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Coding Agent on Aggregated (RebenchT, CodeCI, Bird)
Loading...
34.99
Overall Average Score
TDScaling
22.3228
25.6114
28.9
32.1886
Feb 3, 2026
Overall Average Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Overall Average Score
TDScaling
Category=Proposed Impl...
2026.02
34.99
Qwen3-Coder-30B-A3B-Instruct
Category=Baseline
2026.02
30.99
TOUCAN
Category=Tool-Learning...
2026.02
29.82
APIGen-MT
Category=Tool-Learning...
2026.02
27.48
Simia
Category=Tool-Learning...
2026.02
22.81
Feedback
Search any
task
Search any
task