Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reinforcement Learning on CartPole (CP) (test)
Loading...
500
Cumulative Reward
SYMPOL
113.12
213.56
314
414.44
Mar 11, 2026
Cumulative Reward
Updated 1mo ago
Evaluation Results
Method
Method
Links
Cumulative Reward
SYMPOL
Number of random trial...
2026.03
500
MLP
Number of random trial...
2026.03
500
SDT
Number of random trial...
2026.03
500
SA-DT
Depth (d)=8, Number of...
2026.03
476
SA-DT
Depth (d)=5, Number of...
2026.03
446
D-SDT
Number of random trial...
2026.03
128
Feedback
Search any
task
Search any
task