Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Multistep Reasoning benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Multistep Reasoning
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
MuSR
Spurious Universal
Accuracy
73.33
53
5d ago
MUSR
Base
Accuracy
61.67
31
1mo ago
SpokenMQA multistep reasoning
Baseline (Thinking)
Accuracy
81.5
6
3mo ago
MUSR-fr
Gamayun
Average Score
33.79
6
3mo ago
Showing 4 of 4 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task