Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Online Reinforcement Learning on CheetahRun DMControl (final)
Loading...
902.24
Normalized Return
GoRL(Diff)
546.092
638.5535
731.015
823.4765
Dec 2, 2025
Normalized Return
Updated 3mo ago
Evaluation Results
Method
Method
Links
Normalized Return
GoRL(Diff)
Seeds=5, Decoder=Diffu...
2025.12
902.24
GoRL(FM)
Seeds=5, Decoder=Flow-...
2025.12
883.4
PPO
Seeds=5
2025.12
724.83
FPO
Seeds=5
2025.12
599.15
DPPO
Seeds=5
2025.12
559.79
Feedback
Search any
task
Search any
task