Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Humanity's Last Exam

Benchmarks

Task NameDataset NameSOTA ResultTrend
ReasoningHumanity's Last Exam
Accuracy84.61
46
Question AnsweringHumanity's Last Exam
Pass@151.7
16
Expert-Level Question AnsweringHumanity's Last Exam
Accuracy40.9
14
Question AnsweringHumanity's Last Exam (HLE) MCQ
Accuracy19.9
6
Long Context EvaluationHumanity's Last Exam AA-LCR
Accuracy54.3
6
World KnowledgeHUMANITY’S LAST EXAM text-only
Score11.1
4
Showing 6 of 6 rows