Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Code Generation on LeetCodeHard
Loading...
38.2
Accuracy
Reflexion
35.392
36.121
36.85
37.579
Dec 3, 2025
Accuracy
Cost per Task ($)
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Cost per Task ($)
Reflexion
Cost setting=High
2025.12
38.2
0.736
Reflexion + BeFS
Cost setting=High
2025.12
38.1
0.512
Reflexion + best-of-N
Cost setting=High
2025.12
37.6
0.508
Reflexion + BeFS
Cost setting=Low
2025.12
36.1
0.168
Reflexion + BeFS
Cost setting=Medium
2025.12
36.1
0.289
Reflexion
Cost setting=Medium
2025.12
35.9
0.449
Reflexion
Cost setting=Low
2025.12
35.5
0.279
Reflexion + best-of-N
Cost setting=Low
2025.12
35.5
0.279
Feedback
Search any
task
Search any
task