Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline Reinforcement Learning on halfcheetah medium v2

4,452Average Score

LRT+Q

-176.7281,024.9612,226.653,428.339Jun 6, 2022Jan 21, 2023Sep 8, 2023Apr 25, 2024Dec 11, 2024Jul 29, 2025Mar 16, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2025.10
4,452-----5.02
2025.10
4,404-----5.11
2025.10
3,526-----3.61
2026.03
66.4------
2026.03
58.5------
2026.03
52.6------
2026.03
51.1------
2026.03
50.4------
2026.03
49.08------
2026.03
45.2------
2026.03
44.4------
2026.03
43.85------
2022.06
43.658.649.53843.528.2-
2026.03
42.8------
2026.03
42.64------
2022.06
41.160.145.634.239.825.7-
2022.06
4059.244.53338.125-
2026.03
39.27------
2022.06
37.257.444.529.73717.7-
2026.03
36.1------
2022.06
33.457.938.325.130.814.9-
2022.06
32.2573723.928.714.4-
2026.03
5.8------
2026.03
2.4------
2026.03
2------
2026.03
1.5------
2026.03
1.3------