Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Symbolic Reasoning benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Symbolic Reasoning
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
Last Letter Concatenation
Zero-Shot CoT
Accuracy
90.4
68
20d ago
Letter
Meta-Reasoning Paradigm
Accuracy
92.4
67
2mo ago
Coin
Meta-Reasoning Paradigm
Accuracy
100
45
2mo ago
Last Letter
In-Writing-BF
Accuracy
0.819
31
20d ago
Coin Flip
Auto-CoT
Accuracy
99.9
27
2mo ago
AQUA
COT
Accuracy
80.3
26
3mo ago
OlyBench
QuaSAR
Accuracy
36.5
25
3mo ago
MMLU Redux
QuaSAR
Accuracy
80.3
25
3mo ago
Countdown
FDM-A
Accuracy
49.61
24
3mo ago
Date
Automatic Model Selection with LLMs
Solve Rate
90.5
14
3mo ago
Lies
Meta-Reasoning Paradigm
Accuracy
100
12
3mo ago
LastLetter (test)
SGE
Accuracy
86.96
11
3mo ago
Coin Flip
RM-Primed
Accuracy
80.75
10
2mo ago
Coinflip 4
Self-consistency
Accuracy
99.5
10
3mo ago
Date Understanding (DU)
SoftCoT
Accuracy
87.2
10
3mo ago
CoinFlip
SC+IC (tune)
Calibrated Accuracy
100
8
3mo ago
Last Letter Concatenation (test)
GPT-3.5-turbo
Accuracy
81.9
8
3mo ago
Last Letter Concatenation (LLC) out-of-distribution (test)
StrategyLLM
LLC-4 Accuracy
98
7
3mo ago
COLOR (Colored Object)
SATLM
Accuracy
99.4
7
3mo ago
ARC-AGI 1 (test)
RIMA
Pass@2
47.5
6
2mo ago
Coin Flip OOD (test)
AMPLIFY
Accuracy
65.7
6
3mo ago
Coin Flip (test)
Few-shot Auto-CoT
Accuracy
98.6
6
3mo ago
Symbolic Longer
SCO
Accuracy (Clean, Avg)
0.187
5
3mo ago
Symbolic Equal
CD-CoT
Acc (Clean, Avg)
42.7
5
3mo ago
Date Understanding (DU) (test)
SoftCoT
Accuracy
67.52
4
3mo ago
Showing 25 of 43 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs