Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Symbolic Reasoning benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Symbolic Reasoning
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
Letter
Meta-Reasoning Paradigm
Accuracy
92.4
67
25d ago
Last Letter Concatenation
Zero-Shot CoT
Accuracy
90.4
58
25d ago
Coin
Meta-Reasoning Paradigm
Accuracy
100
45
25d ago
Coin Flip
Auto-CoT
Accuracy
99.9
27
25d ago
AQUA
COT
Accuracy
80.3
26
1mo ago
OlyBench
QuaSAR
Accuracy
36.5
25
1mo ago
MMLU Redux
QuaSAR
Accuracy
80.3
25
1mo ago
Countdown
FDM-A
Accuracy
49.61
24
1mo ago
Last Letter
In-Writing-BF
Accuracy
0.819
21
1mo ago
Date
Automatic Model Selection with LLMs
Solve Rate
90.5
14
1mo ago
Lies
Meta-Reasoning Paradigm
Accuracy
100
12
1mo ago
LastLetter (test)
SGE
Accuracy
86.96
11
1mo ago
Coin Flip
RM-Primed
Accuracy
80.75
10
25d ago
Coinflip 4
Self-consistency
Accuracy
99.5
10
1mo ago
Date Understanding (DU)
SoftCoT
Accuracy
87.2
10
1mo ago
CoinFlip
SC+IC (tune)
Calibrated Accuracy
100
8
1mo ago
Last Letter Concatenation (test)
GPT-3.5-turbo
Accuracy
81.9
8
1mo ago
Last Letter Concatenation (LLC) out-of-distribution (test)
StrategyLLM
LLC-4 Accuracy
98
7
1mo ago
COLOR (Colored Object)
SATLM
Accuracy
99.4
7
1mo ago
ARC-AGI 1 (test)
RIMA
Pass@2
47.5
6
1mo ago
Coin Flip OOD (test)
AMPLIFY
Accuracy
65.7
6
1mo ago
Coin Flip (test)
Few-shot Auto-CoT
Accuracy
98.6
6
1mo ago
Symbolic Longer
SCO
Accuracy (Clean, Avg)
0.187
5
1mo ago
Symbolic Equal
CD-CoT
Acc (Clean, Avg)
42.7
5
1mo ago
Date Understanding (DU) (test)
SoftCoT
Accuracy
67.52
4
1mo ago
Showing 25 of 42 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs