Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Code Generation on HumanEval Hard
Loading...
97.78
Base Pass Rate
TextBFGS
90.8432
92.6441
94.445
96.2459
Jan 20, 2026
Base Pass Rate
Plus Pass Rate
Updated 3mo ago
Evaluation Results
Method
Method
Links
Base Pass Rate
Plus Pass Rate
TextBFGS
Knowledge Base (KB) So...
2026.01
97.78
93.33
TextBFGS-REMO
Knowledge Base (KB) So...
2026.01
95.56
91.11
TextGrad
Knowledge Base (KB) So...
2026.01
91.11
82.22
TextGrad-Momentum
Knowledge Base (KB) So...
2026.01
91.11
86.67
TextBFGS (w/o KB)
Knowledge Base (KB) So...
2026.01
91.11
82.22
Feedback
Search any
task
Search any
task