Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Combinatorial Reasoning
Loading...
100
Graph Accuracy
Llama 70B
6.816
31.008
55.2
79.392
Mar 3, 2025
Graph Accuracy
Sudoku-3 Accuracy
Sudoku-4 Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Graph Accuracy
Sudoku-3 Accuracy
Sudoku-4 Accuracy
Llama 70B
Algorithm=BoN
2025.03
100
96.7
80
Llama 1B
Algorithm=SEM-CTRL
2025.03
100
100
100
Llama 8B
Algorithm=SEM-CTRL
2025.03
100
100
100
Llama 70B
Algorithm=SEM-CTRL
2025.03
100
100
100
o4-mini
Algorithm=API
2025.03
96.1
100
100
o1-preview
Algorithm=API
2025.03
92
100
100
DeepSeek-R1
Algorithm=API
2025.03
85
100
100
Llama 8B
Algorithm=BoN
2025.03
52.4
70
80
Llama 70B
Algorithm=Base
2025.03
37.5
90
30
Llama 8B
Algorithm=Base
2025.03
25
40
10
Llama 1B
Algorithm=BoN
2025.03
24.3
0
0
Llama 1B
Algorithm=Base
2025.03
10.4
0
0
Feedback
Search any
task
Search any
task