Share your thoughts, 1 month free Claude Pro on usSee more

Code Optimization on HumanEval-Hard Cross-Domain

13.6Calls per Task

TextBFGS

Updated 5mo ago

Evaluation Results

Method	Links
TextBFGS 2026.01		13.6	1,581.7	21.6
TextBFGS-REMO 2026.01		13.9	1,594.4	22.2
TextBFGS 2026.01		17	1,481.3	25.2
TextGrad-Momentum 2026.01		29.8	1,464.2	43.7
TextGrad 2026.01		35.8	863.9	30.9