Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline Reinforcement Learning on halfcheetah medium-replay

68.4Normalized Score

LAWM

5.37621.73838.154.462Oct 10, 2023Mar 18, 2024Aug 25, 2024Feb 1, 2025Jul 11, 2025Dec 18, 2025May 27, 2026
Updated 5d ago

Evaluation Results

MethodLinks
2025.12
68.4
2025.12
65
62
2026.05
57.1
2026.02
55.7
2023.10
55.1
2026.02
54.9
2026.05
54.1
2025.12
53.9
2026.02
51.4
2023.10
49.3
2026.05
48.9
2023.10
48.3
2023.10
48
2025.12
46.6
2024.02
46.2
2026.02
45.5
2026.02
45.3
2026.05
44.6
2023.10
44.2
2026.02
44.2
2026.05
44.1
2026.02
44
2026.05
42.9
2023.10
41.9
2026.05
41.3
2026.05
41.3
2026.05
40.4
2023.10
39.8
2026.05
39.6
2023.10
38.6
2023.10
38.2
2026.05
36.6
2024.02
35.6
2025.12
34.6
2024.02
34.1
2024.02
33
2024.02
32.8
2026.02
26.7
2025.12
25.1
2026.02
21.5
2026.02
20.1
2026.02
20
2025.12
19.4
2025.12
17.9
2026.02
17.6
2026.02
17.5
2025.12
14.8
2026.02
14.4
2025.12
14
2025.12
12.5
2025.12
9.5
2025.12
8.6
2024.02
7.8