Share your thoughts, 1 month free Claude Pro on usSee more

Online Reinforcement Learning on DMControl FingerSpin (final)

903.92Normalized Return

GoRL(FM)

Updated 5mo ago

Evaluation Results

Method	Links
GoRL(FM) 2025.12		903.92
GoRL(Diff) 2025.12		844.74
DPPO 2025.12		694.06
PPO 2025.12		539.03
FPO 2025.12		56.05