Share your thoughts, 1 month free Claude Pro on usSee more

Code Generation on HumanEval+ (SR, LLM%)

98.8SR

LLM-only

Updated 2mo ago

Evaluation Results

Method	Links
LLM-only 2026.05		98.8	100
Oracle Router 2026.05		98.6	8.1
Heuristic Router 2026.05		97.4	26.6
R2V 2026.05		94.3	0.6
Entropy Router 2026.05		92.9	0.8
SLM-only 2026.05		91.9	0