Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Coding Agent on RebenchT
Loading...
33.13
OH-p@1
TDScaling
20.9204
24.0902
27.26
30.4298
Feb 3, 2026
OH-p@1
Qod-p@1
Updated 4d ago
Evaluation Results
Method
Method
Links
OH-p@1
Qod-p@1
TDScaling
Category=Proposed Impl...
2026.02
33.13
23.56
Qwen3-Coder-30B-A3B-Instruct
Category=Baseline
2026.02
31.21
15.84
TOUCAN
Category=Tool-Learning...
2026.02
28.75
19.94
APIGen-MT
Category=Tool-Learning...
2026.02
27.66
17.22
Simia
Category=Tool-Learning...
2026.02
21.39
7.83
Feedback
Search any
task
Search any
task