Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline-to-Online Reinforcement Learning on D4RL Gym-Locomotion

88.36HalfCheetah Return (Random)

ROAD

75.318478.704282.0985.4758May 14, 2026
Updated 19d ago

Evaluation Results

MethodLinks
2026.05
88.3677.6786.9492.196.8632.0853.563.6379.0479.6334.4668.4358.6170.8157.58
2026.05
87.3562.5778.8689.6893.8615.9334.4748.8467.0752.2223.0149.139.3350.264.74
2026.05
80.4373.4482.7189.4493.7516.5434.8350.4941.6737.1113.4942.635.9850.2740.26
2026.05
80.0771.2883.5388.1493.1716.8235.9540.5166.2948.5722.6752.1742.2739.7860.24
2026.05
79.1164.2673.3988.3589.5512.0634.8538.8670.6149.2636.6145.7339.2358.7943.52
2026.05
78.1672.6883.2789.993.6115.7738.8940.4266.3637.6421.6540.8133.5749.5341.82
2026.05
78.0778.7381.4488.293.4827.1436.0839.6657.7436.9929.8238.4440.2942.4558.57
2026.05
76.9472.5183.3791.0896.2831.6342.2137.4723.0389.0617.8147.341.4348.8349.84
2026.05
76.6560.3278.4887.3590.7313.8428.3934.0153.7350.3125.255.9934.0756.3843.06
2026.05
75.8274.285.389.2894.4428.236.2441.5561.6946.2320.7955.9734.5566.1362.31