Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Question Answering on MMLU Professional Law

87.7Accuracy

GPT-4

17.81235.95654.172.244Dec 23, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
87.7-84.32
2025.12
65.78-74.22
2025.12
56.66-57.94
2025.12
51.96-55.1
2025.12
44.98-42.32
2025.12
34.22-33.97
2025.12
33.75-33.82
2025.12
33.59-32.66
2025.12
31.22-31.14
2025.12
30.81-31.64
2025.12
28.11-33.1
2025.12
27.7-27.87
2025.12
27.7-26.77
2025.12
27.6-24.8
2025.12
26.4-24.9
2025.12
25.68-24.16
2025.12
25.65-27.98
2025.12
25.55-24.75
2025.12
25.5-27.26
2025.12
25.1-26.89
2025.12
24.92-24.74
2025.12
24.9-23.88
2025.12
24.6-26.42
2025.12
24.3-24.2
2025.12
23.2-20.77
2025.12
20.5-21.1