Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Language Reasoning benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Language Reasoning
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
TruthfulQA
FP16
Accuracy
40.15
12
1mo ago
Language Reasoning Average
FP16
Accuracy
73.25
12
1mo ago
BBH (BIG-Bench Hard)
MIPROv2
Object Counting Score
99.4
8
27d ago
LangR unseen tasks (test)
SGE
Pass@1
60.8
3
1mo ago
Showing 4 of 4 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task