Share your thoughts, 1 month free Claude Pro on usSee more

LLM Throughput on AIME (Mathematical Reasoning)

998Throughput (token/s)

FlashEvolve

Updated 2mo ago

Evaluation Results

Method	Links
FlashEvolve 2026.05		998	11.4
Combee 2026.05		994	6.2
Combee 2026.05		977	1.6
FlashEvolve 2026.05		485	6.6
Combee 2026.05		336	0.6
Combee 2026.05		211	1
GEPA 2026.05		200	2.2
GEPA 2026.05		103	1.3