Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Program Synthesis on PSB1 (train)
Loading...
100
Compare String Lengths
HOTGP
8.48
32.24
56
79.76
Nov 28, 2025
Compare String Lengths
Checksum
Count Odds
Digits
Double Letters
Even Squares
For Loop Index
Grade
Last Index of Zero
Median
Mirror Image
Negative to Zero
Number IO
Pig Latin
Replace Space with Newline
Scrabble Score
Small or Large
Smallest
String Lengths Backwards
Sum of Squares
Super Anagrams
Syllables
Vector Average
Vectors Summed
X Word Lines
# Best Results
Equals 100%
Greater Than or Equal to 75%
Greater Than or Equal to 50%
Greater Than 0%
Updated 4d ago
Evaluation Results
Method
Method
Links
Compare String Lengths
Checksum
Count Odds
Digits
Double Letters
Even Squares
For Loop Index
Grade
Last Index of Zero
Median
Mirror Image
Negative to Zero
Number IO
Pig Latin
Replace Space with Newline
Scrabble Score
Small or Large
Smallest
String Lengths Backwards
Sum of Squares
Super Anagrams
Syllables
Vector Average
Vectors Summed
X Word Lines
# Best Results
Equals 100%
Greater Than or Equal to 75%
Greater Than or Equal to 50%
Greater Than 0%
HOTGP
Split=Training
2025.11
100
-
46
-
0
0
73
37
0
82
1
100
100
-
38
-
28
98
87
1
-
0
78
34
-
-
3
7
8
15
HOTGP
Split=Training, Simpli...
2025.11
100
-
50
-
0
0
73
39
0
100
1
100
100
-
38
-
28
100
89
1
-
0
80
37
-
-
5
7
9
15
G3P+
Split=Training
2025.11
96
0
4
0
0
1
8
63
97
99
89
24
95
4
29
1
39
100
20
5
43
53
5
28
0
0
1
6
8
19
G3Phs
Split=Training
2025.11
94
-
-
-
-
-
-
-
0
100
-
0
100
-
-
-
30
100
0
-
30
-
67
100
-
1
4
5
6
8
G3Ppy
Split=Training
2025.11
12
-
-
-
-
-
-
-
2
39
-
68
100
-
-
-
0
99
35
-
51
-
0
0
-
1
1
2
4
8
Feedback
Search any
task
Search any
task