Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline Reinforcement Learning on walker2d medium

1,248Normalized Score

QDFM

-68.64273.18615956.82Oct 10, 2023Mar 18, 2024Aug 25, 2024Feb 1, 2025Jul 11, 2025Dec 18, 2025May 27, 2026
Updated 5d ago

Evaluation Results

MethodLinks
2026.02
1,248
2026.02
930
2026.02
665
2026.02
659
2026.02
248
2026.05
94.7
2026.02
92.4
2026.02
91.6
2025.12
91.4
2023.10
89.6
89.3
2026.05
88.8
2026.05
88.2
2026.02
87.6
2026.02
84.9
2026.05
83.8
2026.05
83.7
2023.10
81.9
2023.10
81.7
2026.05
81
2023.10
80.8
2026.05
79.9
2024.02
79.2
2026.05
79.2
2026.05
79.1
2023.10
78.3
2026.02
78.3
2023.10
77.2
2026.02
76.56
2026.02
75.77
2026.05
74
2024.02
73.3
2026.02
73.3
2025.12
71.5
2025.12
71.1
2024.02
69.9
2025.12
68.7
2026.02
68.2
2024.02
67.1
2025.12
64.7
2025.12
64.3
2024.02
63.7
2026.02
60.22
2023.10
59.1
2026.02
55.96
2026.02
54.6
2023.10
53.1
2026.02
51.89
2026.02
50.1
2026.02
39.16
2026.02
30.69
2025.12
24.3
2025.12
23.7
2023.10
21.8
2025.12
19.3
2025.12
16.4
2025.12
15.8
2025.12
14.3
2025.12
10.6
2024.02
0
2026.02
-18