Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Synthetic Grammar Synthesis on Synthetic Grammar Synthesis (a^m b^n c^m d^n)
Loading...
100
Accuracy
SEM-CTRL
-4
23
50
77
Mar 3, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
SEM-CTRL
Algorithm=SEM-CTRL, Mo...
2025.03
100
SEM-CTRL
Algorithm=SEM-CTRL, Mo...
2025.03
100
o4-mini
Algorithm=API, Model=o...
2025.03
93.3
o1-preview
Algorithm=API, Model=o...
2025.03
80
DeepSeek-R1
Algorithm=API, Model=D...
2025.03
70
BoN
Algorithm=BoN, Model=L...
2025.03
22.2
BoN
Algorithm=BoN, Model=L...
2025.03
1.1
Base
Algorithm=Base, Model=...
2025.03
0
Base
Algorithm=Base, Model=...
2025.03
0
Feedback
Search any
task
Search any
task