Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Logic reasoning benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Logic reasoning
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
Tracking Shuffled Objects BBH
Role-Play Prompting
Accuracy
71.33
59
14d ago
Zebralogic
Qwen 3 VL 32B Think
Score
96.1
42
3mo ago
Causal Judgement
Self-discover
Accuracy
36
30
3mo ago
K&K Logic Puzzles OOD
o3-mini-high
Score Threshold 2 (OOD)
99
25
2mo ago
K&K Logic Puzzles In-domain
o3-mini-high
Accuracy (Level 3)
98
25
2mo ago
LogicVista
Qwen-8B-DeltaThinker
LogicVista Accuracy
61.97
16
2d ago
Autologic en
DARL
Score
0.439
16
3mo ago
Autologic cn
DARL
Score
40.3
16
3mo ago
ZebraLogic
Baseline (Thinking)
Accuracy
96
15
3mo ago
ZebraLogic
NPR
Avg Accuracy @1
0.817
11
3mo ago
ARC (eval)
NSA (ours)
Tasks Solved
75
10
1d ago
ARC (train)
Ainooson Brute Force [2]
Tasks Solved
26
9
1d ago
Sudoku 8B Instruct (test)
Prob Margin
Accuracy
71.7
9
1mo ago
MMStar
BAGEL+Ours
MMStar Score
67.9
8
16d ago
Zebra riddles (test)
Glauber-UL2 (N=3)
Accuracy
98.7
7
27d ago
Riddle 1.0 (test)
INMS
F1 Score
69
7
2mo ago
Pun 1.0 (test)
BM25
F1 Score
41
7
2mo ago
Puzzle 1.0 (test)
BM25
F1 Score
19
7
2mo ago
Logic Reasoning Suite LogicVista, VisuLogic
Uni-OPD
Accuracy on LogicVista
54
6
28d ago
Zebralogic
SUPERNOVA-4B
Pass@8
77
6
1mo ago
Visu Logic
Uni-OPD
Visu Logic Score
28
4
28d ago
ARC-Challenge & LogiQA OpenCompass (test)
CRITIQ
ARC-C Accuracy
38.31
4
3mo ago
Large-scale model pool Logic Reasoning 15 LLMs
RouteMoA
Accuracy
95.6
3
3mo ago
CommonsenseQA
MIG
Pass@1
69.8
3
3mo ago
KiVA
BAGEL+Ours
KiVA Score
35.2
2
16d ago
Showing 25 of 26 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs