Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reinforcement Learning on Pendulum PD-C (test)
Loading...
854
Cumulative Reward
SA-DT
-1,430.88
-837.69
-244.5
348.69
Mar 11, 2026
Cumulative Reward
Updated 1mo ago
Evaluation Results
Method
Method
Links
Cumulative Reward
SA-DT
Depth (d)=8, Number of...
2026.03
854
SYMPOL
Number of random trial...
2026.03
323
SDT
Number of random trial...
2026.03
310
MLP
Number of random trial...
2026.03
191
SA-DT
Depth (d)=5, Number of...
2026.03
-1,251
D-SDT
Number of random trial...
2026.03
-1,343
Feedback
Search any
task
Search any
task