CURL: Contrastive Unsupervised Representations for Reinforcement Learning

About

We present CURL: Contrastive Unsupervised Representations for Reinforcement Learning. CURL extracts high-level features from raw pixels using contrastive learning and performs off-policy control on top of the extracted features. CURL outperforms prior pixel-based methods, both model-based and model-free, on complex tasks in the DeepMind Control Suite and Atari Games showing 1.9x and 1.2x performance gains at the 100K environment and interaction steps benchmarks respectively. On the DeepMind Control Suite, CURL is the first image-based algorithm to nearly match the sample-efficiency of methods that use state-based features. Our code is open-sourced and available at https://github.com/MishaLaskin/curl.

Aravind Srinivas, Michael Laskin, Pieter Abbeel• 2020

Related benchmarks

Task	Dataset	Result
Multimodal Robotic Control	Fetch-PickAndPlace Patch corruptions (test)	Return1.43	42
Continuous Control	DMControl 500k	Spin Score926	42
Reinforcement Learning	Atari 100k	Alien Score711	41
Continuous Control	DMControl 100k	DMControl: Finger Spin Score767	38
Point-Goal navigation	Gibson (held-out scenes)	Average SR (All Scenes)1.14e+3	37
Control	DMControl	DMControl: Ball in Cup Catch Score888.4	29
Robot Manipulation	Fetch-Slide (test)	Return8.15	28
Reinforcement Learning	Atari100k (test)	Alien Score558.2	23
PointGoal Navigation	iGibson Ihlen 0 int 1.0 (test)	SR40.8	22
PointGoal Navigation	iGibson Rs int 1.0 (test)	Success Rate4.19e+3	22

Showing 10 of 80 rows

...

Other info

Code

Follow for update

@wizwand_team Discord