Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Online Reinforcement Learning on HopperStand DMControl (final)
Loading...
874.63
Normalized Return
GoRL(Diff)
-32.7596
202.8127
438.385
673.9573
Dec 2, 2025
Normalized Return
Updated 3mo ago
Evaluation Results
Method
Method
Links
Normalized Return
GoRL(Diff)
Seeds=5, Decoder=Diffu...
2025.12
874.63
GoRL(FM)
Seeds=5, Decoder=Flow-...
2025.12
733.66
PPO
Seeds=5
2025.12
286.09
FPO
Seeds=5
2025.12
3.94
DPPO
Seeds=5
2025.12
2.14
Feedback
Search any
task
Search any
task