OGBench: Benchmarking Offline Goal-Conditioned RL

About

Offline goal-conditioned reinforcement learning (GCRL) is a major problem in reinforcement learning (RL) because it provides a simple, unsupervised, and domain-agnostic way to acquire diverse behaviors and representations from unlabeled data without rewards. Despite the importance of this setting, we lack a standard benchmark that can systematically evaluate the capabilities of offline GCRL algorithms. In this work, we propose OGBench, a new, high-quality benchmark for algorithms research in offline goal-conditioned RL. OGBench consists of 8 types of environments, 85 datasets, and reference implementations of 6 representative offline GCRL algorithms. We have designed these challenging and realistic environments and datasets to directly probe different capabilities of algorithms, such as stitching, long-horizon reasoning, and the ability to handle high-dimensional inputs and stochasticity. While representative algorithms may rank similarly on prior benchmarks, our experiments reveal stark strengths and weaknesses in these different capabilities, providing a strong foundation for building new algorithms. Project page: https://seohong.me/projects/ogbench

Seohong Park, Kevin Frans, Benjamin Eysenbach, Sergey Levine• 2024

Related benchmarks

Task	Dataset	Result
Trajectory Stitching	pointmaze giant-stitch v0	Success Rate0.00e+0	21
Visuomotor Control	Push T	Success Rate33	21
Manipulation	OGBench cube-triple-play	Success Rate18	19
Robotic Planning	OGBench Scene 48 (play)	Success Rate0.42	16
Manipulation	OGBench cube-triple-noisy	Success Rate5	16
Manipulation	scene-play v0	Success Rate51	16
Robotic Planning	OGBench PointMaze Giant 48 (stitch)	Success Rate0.00e+0	16
Robotic Planning	OGBench AntMaze Giant 48 (stitch)	Success Rate0.00e+0	16
antmaze-medium-navigate	OGBench 100% offline dataset	Success Rate95	12
cube-single-play	OGBench 100% offline dataset	Success Rate0.68	12

Showing 10 of 41 rows

Other info

Follow for update

@wizwand_team Discord