Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on Cheetah-Wind-S dynamics changes (time-step)

-42.3Average Return

Ada-Diffuser + DP (Oracle)

-123.732-102.591-81.45-60.309May 15, 2026
Updated 16d ago

Evaluation Results

MethodLinks
2026.05
-42.3
2026.05
-44.7
2026.05
-48
2026.05
-52.9
2026.05
-63.4
2026.05
-76.5
2026.05
-87.8
2026.05
-120.6