Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Offline Reinforcement Learning on hopper medium-replay

113Normalized Score

Proposed

-2.12827.76157.6587.539Jun 16, 2021Mar 28, 2022Jan 7, 2023Oct 19, 2023Jul 30, 2024May 11, 2025Feb 21, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2023.10
113
103.1
2023.10
102.5
2026.02
101.9
2026.02
100.5
2026.02
99.9
2026.02
97.5
2023.10
94.7
2026.02
94.7
2021.06
94.1
2023.10
89.5
2026.02
84.43
2026.02
83.99
2021.06
77.3
2026.02
75.22
2021.06
71
2026.02
70.06
2024.02
63.7
2026.02
56.96
2024.02
52.1
2024.02
50.8
2026.02
50.79
2024.02
48.6
2021.06
48.6
2026.02
46.7
2023.10
44.4
2026.02
39.22
2024.02
38.7
2023.10
36.4
2026.02
35.9
2023.10
33.7
2023.10
33.1
2026.02
33.03
2023.10
32.6
2026.02
30.17
2021.06
21.2
2024.02
7.7
2025.12
7.4
2025.12
5.6
2025.12
4.6
2025.12
4.1
2025.12
3.7
2025.12
3.5
2025.12
2.3