Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multitask Language Understanding on GlobalMMLU

48.65Accuracy

Qwen3

25.19831.286537.37543.4635Dec 25, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
48.65----------
2025.12
47----------
2025.12
40.58----------
2025.12
39.27----------
2025.12
36.04----------
2025.12
26.1----------
2026.01
-27.940.531.132.533.832.830.432.938.931.1
2026.01
-29.440.733.235.137.535.132.335.138.733.4