Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reinforcement Learning on Cheetah-Wind-S dynamics changes (time-step)
Loading...
-42.3
Average Return
Ada-Diffuser + DP (Oracle)
-123.732
-102.591
-81.45
-60.309
May 15, 2026
Average Return
Updated 16d ago
Evaluation Results
Method
Method
Links
Average Return
Ada-Diffuser + DP (Oracle)
Base Algorithm=DP, Lat...
2026.05
-42.3
Ada-Diffuser + IDQL (Oracle)
Base Algorithm=IDQL, L...
2026.05
-44.7
Ada-Diffuser + IDQL
Base Algorithm=IDQL, L...
2026.05
-48
Ada-Diffuser + DP
Base Algorithm=DP, Lat...
2026.05
-52.9
IDQL + DynaMITE
Base Algorithm=IDQL, L...
2026.05
-63.4
DP + DynaMITE
Base Algorithm=DP, Lat...
2026.05
-76.5
IDQL
Base Algorithm=IDQL, L...
2026.05
-87.8
DP
Base Algorithm=DP, Lat...
2026.05
-120.6
Feedback
Search any
task
Search any
task