Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Long-context Generation (Reasoning) on MMLU-Pro

13.67TPT

SWIFT

12.406820.933429.4637.9866Feb 23, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.02
13.670.7453.6
2026.02
15.821.0887
2026.02
20.321.1293
2026.02
21.750.946.8
2026.02
22.751.1690.8
2026.02
23.151-
2026.02
23.20.870
2026.02
23.7417.5
2026.02
23.91.0992
2026.02
24.061-
2026.02
24.610.8517.2
2026.02
24.971-
2026.02
25.681.1589.9
2026.02
27.910.8823.9
2026.02
30.931-
2026.02
34.621.285.1
2026.02
35.511.1688
2026.02
38.360.9966.1
2026.02
40.011.3391.2
2026.02
45.251.2990.2