Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Circle Packing on CirclePacking T
Loading...
2.636
Best Score
ThetaEvolve (w/ RL)
0.892753
1.345323
1.797893
2.250463
Nov 28, 2025
Best Score
Mean Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Best Score
Mean Score
ThetaEvolve (w/ RL)
Model=Distill-Qwen3-8B...
2025.11
2.636
2.636
ThetaEvolve (w/o RL late)
Model=Distill-Qwen3-8B...
2025.11
2.636
2.636
ThetaEvolve (w/o RL early)
Model=Distill-Qwen3-8B...
2025.11
2.636
2.6354
ThetaEvolve (w/ RL)
Model=ProRL-1.5B-v2, S...
2025.11
2.5225
2.3498
ThetaEvolve (w/o RL late)
Model=ProRL-1.5B-v2, S...
2025.11
2.2491
2.0991
ThetaEvolve (w/o RL early)
Model=ProRL-1.5B-v2, S...
2025.11
2.1343
2.0265
Initial
Model=ProRL-1.5B-v2, S...
2025.11
0.9598
-
Initial
Model=Distill-Qwen3-8B...
2025.11
0.9598
-
Feedback
Search any
task
Search any
task