Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on Cheetah-Wind-E dynamics changes episodic

-41.6Average Return

Ada-Diffuser + IDQL (Oracle)

-107.328-90.264-73.2-56.136May 15, 2026
Updated 16d ago

Evaluation Results

MethodLinks
2026.05
-41.6
2026.05
-48.5
2026.05
-52
2026.05
-58.5
2026.05
-59
2026.05
-72.2
2026.05
-97.5
2026.05
-104.8