Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reinforcement Learning on AntMaze Large Play
Loading...
46.7
OSR
RankQ
-1.868
10.741
23.35
35.959
May 11, 2026
OSR
SR
Updated 21d ago
Evaluation Results
Method
Method
Links
OSR
SR
RankQ
variant=+SAC
2026.05
46.7
0
CQL
variant=+SAC
2026.05
43.3
0
Cal-QL
2026.05
36.7
67.7
RankQ
2026.05
36.7
91.2
Cal-QL
variant=+SAC
2026.05
30
0
CQL
2026.05
28.3
82.8
Hybrid RL
2026.05
0
0
SAC
2026.05
0
0
SAC
variant=+OFF
2026.05
0
0
Feedback
Search any
task
Search any
task