Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Generalized Planning on Childsnack

100Scale

Explicitly regularized Q-value policy (Ω^Exp.)

5.3629.9354.579.07Mar 18, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
10093
10093
10065.8
9846.9
8341.5
8243.1
2026.03
7143.4
2026.03
3614.9
2026.03
3115.9
2026.03
2612
2026.03
164.2
2026.03
90.2