Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Meta-Evaluation on Aggregated Benchmarks (AIME, ARC, GSM8K, HE, MMLU, IT, RU, BFCL)

2.4Overall Average Rank

GPT-5 nano

1.8885.3448.812.256May 8, 2026
Updated 23d ago

Evaluation Results

MethodLinks
2026.05
2.4
2026.05
2.6
2026.05
4.2
2026.05
4.5
2026.05
5
2026.05
5.1
2026.05
8
2026.05
8.2
2026.05
9.1
2026.05
9.9
2026.05
10.4
2026.05
11
2026.05
12.4
2026.05
13.4
2026.05
14.5
2026.05
15.2