Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Offline-to-online Reinforcement Learning on D4RL Hopper medium discretized
Loading...
47.9
Online Normalized Score
DRIFT
-1.5
11.325
24.15
36.975
May 12, 2026
Online Normalized Score
Offline Normalized Score
Updated 21d ago
Evaluation Results
Method
Method
Links
Online Normalized Score
Offline Normalized Score
DRIFT
discretisation=k-means...
2026.05
47.9
0.1
Cal-QL
discretisation=k-means...
2026.05
44.1
28.8
PEX
discretisation=k-means...
2026.05
43.1
0.4
CQL
discretisation=k-means...
2026.05
35.8
28.8
IQL
discretisation=k-means...
2026.05
27.9
0.4
AWAC
discretisation=k-means...
2026.05
25.3
26
DQN
discretisation=k-means...
2026.05
23.7
0.4
PPO
discretisation=k-means...
2026.05
3.1
-
SPA
discretisation=k-means...
2026.05
0.4
0.4
Feedback
Search any
task
Search any
task