Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline Reinforcement Learning on D4RL Medium-Replay Hopper

110.6Normalized Score

NEUBAY

34.6854.3974.193.81May 24, 2023Nov 23, 2023May 24, 2024Nov 23, 2024May 25, 2025Nov 24, 2025May 27, 2026
Updated 6d ago

Evaluation Results

MethodLinks
2025.12
110.6
2026.05
110
2025.12
109.9
2025.12
109.6
2026.03
106.6
2026.03
106.2
2023.10
106.2
2026.05
106.2
2025.12
104.4
2026.03
104.4
2026.05
104.4
2025.12
104.3
2025.12
104
2025.12
103.9
2025.12
103.9
2025.12
103.5
2023.10
102.5
2026.03
101.9
2026.05
101.9
2026.05
101.9
2023.10
101.7
2026.05
101.2
2025.12
101
2023.10
101
2023.06
100
100
2026.05
99.8
2026.03
99.5
2026.05
99.5
2023.10
99.4
2025.12
98.5
2025.12
98.3
2025.12
98.2
2026.03
97.8
2026.05
97.8
2024.04
97.4
2023.06
96.8
2025.12
96.8
2023.10
96.6
2023.06
95.3
2023.06
95
2026.02
95
2026.02
95
2023.06
95
2023.06
94.7
2023.10
94.7
2025.12
94.2
2023.06
93.6
2025.12
92.8
2025.12
92.5
2025.12
92.1
2023.06
91.5
2023.06
91.5
89.5
2025.12
89.5
2025.12
89.5
2023.10
89.5
2026.02
89.4
2023.06
89.1
2024.04
88.9
2026.02
87.8
2023.06
87.3
86.9
2025.12
86.3
2026.03
86.3
2026.05
86.3
2023.06
86.1
2025.12
85
2023.05
84.54
2023.06
84.4
2023.06
83.7
2023.05
83.06
2023.06
82.7
2025.12
82.7
2023.06
82.7
2023.06
82.6
2026.02
82.5
2025.12
82.4
2024.04
82
2026.03
81.8
2026.05
81.8
2026.02
81.7
2023.06
80.9
2023.06
78.3
2023.05
73.57
2023.05
70.2
2026.05
64.8
2023.05
62
2026.02
60.9
2023.05
57.88
2024.04
56.2
2025.12
54.9
2025.12
53.5
2026.02
52.1
2025.12
51.6
2025.12
51.3
2026.02
49.6
2024.04
45.6
2025.12
40.6
2025.12
37.6
Showing 100 of 109 rows