Our new X account is live! Follow @wizwand_team for updates
Search any
task
Feedback
Search any
task
SOTA Coding Agent benchmarks and papers with code | Wizwand
Our new X account is live! Follow @wizwand_team for updates
Home
/
Tasks
Coding Agent
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
Aggregated (RebenchT, CodeCI, Bird)
TDScaling
Overall Average Score
34.99
5
4d ago
Bird
TDScaling
Pass@1
43.83
5
4d ago
CodeCI
TDScaling
Avg@2
39.43
5
4d ago
RebenchT
TDScaling
OH-p@1
33.13
5
4d ago
Showing 4 of 4 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task