Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline Reinforcement Learning on hopper medium-replay

113Normalized Score

Proposed

-2.12827.76157.6587.539Jun 16, 2021Mar 28, 2022Jan 7, 2023Oct 19, 2023Jul 30, 2024May 11, 2025Feb 21, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2023.10
113
103.1
2023.10
102.5
2026.02
101.9
2026.02
100.5
2026.02
99.9
2026.02
97.5
2023.10
94.7
2026.02
94.7
2021.06
94.1
2023.10
89.5
2026.02
84.43
2026.02
83.99
2021.06
77.3
2026.02
75.22
2021.06
71
2026.02
70.06
2024.02
63.7
2026.02
56.96
2024.02
52.1
2024.02
50.8
2026.02
50.79
2024.02
48.6
2021.06
48.6
2026.02
46.7
2023.10
44.4
2026.02
39.22
2024.02
38.7
2023.10
36.4
2026.02
35.9
2023.10
33.7
2023.10
33.1
2026.02
33.03
2023.10
32.6
2026.02
30.17
2021.06
21.2
2024.02
7.7
2025.12
7.4
2025.12
5.6
2025.12
4.6
2025.12
4.1
2025.12
3.7
2025.12
3.5
2025.12
2.3