Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Grammar Synthesis on L2 a^n b^n c^m
Loading...
100
Accuracy
Llama 1B
2.968
28.159
53.35
78.541
Apr 12, 2026
Accuracy
Updated 5d ago
Evaluation Results
Method
Method
Links
Accuracy
Llama 1B
G=GASG
2026.04
100
Llama 1B
G=G^ ASG
2026.04
100
Llama 70B
G=GASG
2026.04
100
Llama 70B
G=G^ ASG
2026.04
100
o1
G=-
2026.04
96.7
o4 mini
G=-
2026.04
93.3
o3 mini
G=-
2026.04
86.7
DeepSeek-R1
G=-
2026.04
86.7
GPT 4.1
G=-
2026.04
76.7
Llama 70B
G=-
2026.04
53.3
Llama 1B
G=-
2026.04
6.7
Feedback
Search any
task
Search any
task