Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline Reinforcement Learning on antmaze medium-play

85.6Score

Q-ALIGN DT

-3.42419.68842.865.912Jun 6, 2022Feb 2, 2023Oct 2, 2023May 31, 2024Jan 28, 2025Sep 27, 2025May 27, 2026
Updated 5d ago

Evaluation Results

MethodLinks
2026.05
85.6
2024.02
84.8
2024.02
84.6
2023.07
82.6
2024.02
81.6
2026.05
81.6
2024.02
80.2
2024.01
80
2026.05
78.6
2023.07
78
2023.07
77.6
2022.06
76.3
2023.07
75.8
75.3
2023.07
72.8
2023.07
71.8
2022.06
71.2
2024.01
71.2
2024.02
71.2
2026.05
71.2
2023.07
70.8
2023.07
63.2
2022.06
61.2
2024.01
61.2
2024.02
61.2
2026.05
61.2
2024.02
58.1
2022.06
35.3
2024.02
33.2
2026.05
33.2
2022.06
10.6
2024.02
10.6
2026.05
10.6
2024.02
4.5
2026.05
4.5
2024.02
4.3
2026.05
4.3
2023.07
0.6
2022.06
0
2022.06
0
2023.07
0
2024.01
0
2024.01
0
2024.01
0