Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Language Understanding on MMLU-Pro (test) by Model Size

23.6MMLU-Pro (test) Accuracy

Model Swarm

20.58421.36722.1522.933May 28, 2026
Updated 5d ago

Evaluation Results

MethodLinks
2026.05
23.6------
2026.05
23.4------
2026.05
23.3------
2026.05
23.2------
2026.05
23.2------
2026.05
22.5------
2026.05
22.4------
2026.05
22.1------
2026.05
21.7------
2026.05
20.7------
2026.05
20.7------
2026.05
-27.544.46165.368.653.4
2026.05
-24.436.851.655.260.445.7
2026.05
-24.94055.460.465.349.2
2026.05
-24.738.554.759.666.548.8
2026.05
-26.241.157.362.465.650.5
2026.05
-26.541.858.162.966.451.1
2026.05
-26.841.758.261.966.651
2026.05
-27.143.559.463.767.852.3