Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Planning on Blocks (Blocksworld)
Loading...
100
Accuracy
Stick-Breaking Transformer
-4
23
50
77
Mar 3, 2025
Apr 4, 2025
May 7, 2025
Jun 9, 2025
Jul 12, 2025
Aug 14, 2025
Sep 16, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Stick-Breaking Transformer
encoding=init-atoms
2025.09
100
Sinusoidal
encoding=init-atoms
2025.09
100
RoPE
encoding=init-atoms
2025.09
100
o4-mini
Algorithm=API, Model=o...
2025.03
98.5
SEM-CTRL
Algorithm=SEM-CTRL, Mo...
2025.03
96.8
DeepSeek-R1
Algorithm=API, Model=D...
2025.03
96.5
o1-preview
Algorithm=API, Model=o...
2025.03
94.5
SEM-CTRL
Algorithm=SEM-CTRL, Mo...
2025.03
74
BoN
Algorithm=BoN, Model=L...
2025.03
48.8
Base
Algorithm=Base, Model=...
2025.03
23.2
BoN
Algorithm=BoN, Model=L...
2025.03
4.3
Base
Algorithm=Base, Model=...
2025.03
0
Feedback
Search any
task
Search any
task