Share your thoughts, 1 month free Claude Pro on usSee more

Code Generation on HumanEval (Performance %)

71.9Performance (%)

ATOM

Updated 1mo ago

Evaluation Results

Method	Links
ATOM 2026.05		71.9
LLM-Debate 2026.05		71.07
Complete 2026.05		70.25
Random 2026.05		69.42
Star 2026.05		68.6
ARG-Designer 2026.05		68.6
CoT 2026.05		67.77
Chain 2026.05		66.12
G-Designer 2026.05		66.12
Vanilla 2026.05		65.29
AgentPrune 2026.05		62.81
AgentDropout 2026.05		62.81