CURL: Contrastive Unsupervised Representations for Reinforcement Learning
About
We present CURL: Contrastive Unsupervised Representations for Reinforcement Learning. CURL extracts high-level features from raw pixels using contrastive learning and performs off-policy control on top of the extracted features. CURL outperforms prior pixel-based methods, both model-based and model-free, on complex tasks in the DeepMind Control Suite and Atari Games showing 1.9x and 1.2x performance gains at the 100K environment and interaction steps benchmarks respectively. On the DeepMind Control Suite, CURL is the first image-based algorithm to nearly match the sample-efficiency of methods that use state-based features. Our code is open-sourced and available at https://github.com/MishaLaskin/curl.
Aravind Srinivas, Michael Laskin, Pieter Abbeel• 2020
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multimodal Robotic Control | Fetch-PickAndPlace Patch corruptions (test) | Return1.43 | 42 | |
| Continuous Control | DMControl 500k | Spin Score926 | 33 | |
| Point-Goal navigation | Gibson (held-out scenes) | Average SR (All Scenes)1.14e+3 | 30 | |
| Control | DMControl | DMControl: Ball in Cup Catch Score888.4 | 29 | |
| Continuous Control | DMControl 100k | DMControl: Finger Spin Score767 | 29 | |
| Robot Manipulation | Fetch-Slide (test) | Return8.15 | 28 | |
| Reinforcement Learning | Atari100k (test) | Alien Score558.2 | 23 | |
| PointGoal Navigation | iGibson Ihlen 0 int 1.0 (test) | SR40.8 | 22 | |
| PointGoal Navigation | iGibson Rs int 1.0 (test) | Success Rate4.19e+3 | 22 | |
| PointGoal Navigation | iGibson Env Avg 1.0 (test) | SR3.14e+3 | 22 |
Showing 10 of 79 rows
...