Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Offline Reinforcement Learning on D4RL halfcheetah-medium-expert

110Normalized Score

VIPO

52.38467.34282.397.258Jun 16, 2021Mar 28, 2022Jan 8, 2023Oct 20, 2023Aug 1, 2024May 13, 2025Feb 23, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
110-
2025.12
109.5-
2022.10
108.5-
2022.10
108.5-
2024.11
108.2-
2025.12
108.2-
2025.12
107.9-
2025.12
106.6-
2022.10
106.3-
2025.12
106.3-
2025.12
105.4-
2024.11
104.4-
2025.12
103.7-
2025.12
103.4-
2024.11
103.2-
2025.12
102.8-
2025.12
101.4-
2023.10
98.5-
2025.12
98.4-
2025.12
96.9-
2024.11
96.8-
2024.11
96.6-
2023.10
95.9-
95.7-
2025.12
95.7-
2021.10
95.6-
2022.10
95-
2022.10
95-
2023.06
95-
2024.04
95-
2025.12
95-
2023.10
94.8-
2024.04
94.7-
2025.12
94.4-
2025.12
93.7-
2021.06
93.5-
2026.02
93.1-
2026.02
92.7-
2021.10
92.6-
2026.02
92.6-
2021.06
91.7-
2023.06
91.6-
2026.02
91.6-
2023.06
91.5-
2023.10
91.1-
2021.10
91-
2021.10
90.8-
2025.12
90.8-
2026.02
90.7-
2023.06
90.6-
2024.05
90.6-
90.6-
2026.02
90.6-
2024.04
90.5-
2023.10
90-
2021.10
90-
2024.05
90-
2025.12
90-
2025.12
90-
2026.02
89.7-
2024.04
89.6-
2024.05
89.6-
2021.10
89.5-
2024.04
88.9-
2024.05
88.9-
2021.10
86.8-
2023.06
86.8-
2024.04
86.8-
2025.12
86.8-
2023.10
86.7-
2023.06
86.7-
2026.02
85.5-
2024.11
83.6-
2024.11
82.5-
2022.10
80.4-
2023.06
79.8-
2025.12
79.8-
2026.02
79-
2022.10
78-
2021.10
77.5-
2021.06
77-
2021.10
72.7-
2025.12
70.1-
2024.10
67.6-
2023.10
66.7-
2023.10
64.7-
2021.06
64.7-
2024.10
64.4-
2024.05
63.3-
2025.12
63-
2022.10
62.9-
2025.12
61.9-
2021.06
60.1-
2022.10
60.1-
2025.12
59.2-
2024.04
56-
2025.12
55.4-
2023.06
55.2-
2025.12
55.2-
2025.12
54.6-
Showing 100 of 134 rows