Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA General Knowledge and Reasoning benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
General Knowledge and Reasoning
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
MMLU
Claude Sonnet 4.5
Accuracy
89.5
24
5d ago
Humanity's Last Exam (HLE) text-only
Gemini 3 Flash
sHLE Score
36.6
11
23d ago
CEval
Qwen3-Next-80B-A3B-Instruct
Accuracy
90.91
4
18d ago
General Tasks Suite BBH, MMLU, CMMLU, C-Eval
Qwen3-Inst
BBH
59.48
4
1mo ago
Showing 4 of 4 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task