Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reinforcement Learning on AntMaze large-play D4RL
Loading...
533
Average Episodic Return
GC-Oracle
8.632
144.766
280.9
417.034
Mar 10, 2024
Jul 15, 2024
Nov 20, 2024
Mar 28, 2025
Aug 2, 2025
Dec 8, 2025
Apr 15, 2026
Average Episodic Return
Updated 3d ago
Evaluation Results
Method
Method
Links
Average Episodic Return
GC-Oracle
2024.03
533
DiSPO
2024.03
306
USFA
2024.03
250
COMBO
2024.03
248
GC-IQL
2024.03
229
FB
2024.03
165
RaMP
2024.03
134
MOPO
2024.03
128
IQL
Learning Phase=Offline RL
2026.04
38.5
O2O-LSVI
Learning Phase=Offline...
2026.04
35.3
Cal-QL
Learning Phase=Offline...
2026.04
31.8
CQL
Learning Phase=Offline RL
2026.04
28.8
Feedback
Search any
task
Search any
task