Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on Pendulum PD-C (test)

854Cumulative Reward

SA-DT

-1,430.88-837.69-244.5348.69Mar 11, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
854
2026.03
323
2026.03
310
2026.03
191
2026.03
-1,251
2026.03
-1,343