Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Program Synthesis on Karel (test)
Loading...
86.04
Generalization Accuracy
Exec
76.7632
79.1716
81.58
83.9884
Jun 29, 2021
Generalization Accuracy
Exact Match Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Generalization Accuracy
Exact Match Accuracy
Exec
beam size=64
2021.06
86.04
39.4
LaSynth
beam size=64
2021.06
83.68
41.12
Shin et al.
beam size=64
2021.06
81.3
42.8
Bunel et al.
beam size=64
2021.06
77.12
32.17
Feedback
Search any
task
Search any
task