Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Offline-to-online Reinforcement Learning on D4RL antmaze-medium-play
Loading...
81.7
OSR
CQL
-3.268
18.791
40.85
62.909
May 11, 2026
OSR
SR
Updated 21d ago
Evaluation Results
Method
Method
Links
OSR
SR
CQL
2026.05
81.7
98
RankQ
variant=+SAC
2026.05
81.7
98.7
CQL
variant=+SAC
2026.05
78.3
98.3
Cal-QL
variant=+SAC
2026.05
75
98.9
RankQ
2026.05
75
97.7
Cal-QL
2026.05
71.7
97.8
Hybrid RL
2026.05
13.3
98.3
SAC
variant=+OFF
2026.05
11.7
98.8
SAC
2026.05
0
0
Feedback
Search any
task
Search any
task