Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Cybersecurity Knowledge Evaluation on MMLU Security

93.2Accuracy

GPT-5

65.53672.71879.987.082Jan 28, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
93.20.007
2026.01
88.40.014
2026.01
880.019
2026.01
87.20.007
2026.01
870.009
2026.01
870.011
2026.01
86.40.017
2026.01
85.80.017
2026.01
84.60.014
2026.01
84.40.01
2026.01
83.60.016
2026.01
83.40.027
2026.01
78.20.023
2026.01
770.026
2026.01
76.80.01
2026.01
76.20.029
2026.01
66.60.022